Gene Haur_3953 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3953 
Symbol 
ID5735814 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4958928 
End bp4960721 
Gene Length1794 bp 
Protein Length597 aa 
Translation table11 
GC content44% 
IMG OID641281103 
ProductTn7-like transposition protein D 
Protein accessionYP_001546713 
Protein GI159900466 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCTTCG CCTCAGCTAA CATTGCTGTT GAAGAACTTT TTGGCACACG AAATATTGTA 
GCTGTTGTTG ATCTTCCTAG CCGACTAGAT ATTTTATCCA ATCGGCTACC CAAGGCATTA
GGTCTTAATT CGGAACTACT TATCAGCAAT CACACGTCAT TGCCTTTGTT TGCGCCTTTT
TTATCTGGTG ATGCTCTCCA CAGAGTCCGA CGTTCAATGC TCGGCAATAA AGGGGGAACA
GTACATTCGC GCATTGGTAT TATGGCTAGT GGAGTCAAGA GCCTCAGAAA GCTGCGTTTT
TGTCCAGTGT GTGTCAATGA AGATCGGTCA AAATATGGAG AATGTTATTG GCATCGGGTT
CATAATATCG CAGGAATCAA TGTTTGTGCC CATCACGGAT GCATTTTACA AGATAGCATT
CTTTCATTTC AACGACAGAG CACCTATTAC GCCTACAACG CAGCAGAAGA TTACATACCA
GAGATAGAGT TTGCAGATCC GACTCCAGCA AATCTGCATC TCTATGCTCT TGCCAAGCAA
GCCTATTGGC TCTTAAACAA TATATCTAAT GCAGATCTGA AGGATATGTA TAATGTCTAT
CGAAGATGCC TTGATTTAAG AGGGCTTACG ACGGCATCCA ATAGCATTCG TCAGCAGCAA
CTTCAATCCA CATTCATAGA TTTCTACACG CCTGAATTGC TTGATACTAT ATCTTGCTCT
TTAGAAGATG TTCATCAAAC ATGGTTATCC CGCCTAGTTG GCGGAGATTA CGTAGCACAT
CATCCTCTTC GTCATCTTCT GGTACTTCAT TTCTTTGGCC ACACGCCGGA GAGCGCATTG
AAGAGTCTTG ATGAACCTAC GCCCGATCCA TTCGGTCACG GCCCTTGGCC ATGTCTTAAT
CGAGTTTGTC CACATTCACA CAAAGATGTT ATTGAAACAT GTAATGTAAG GCGTAGTCGG
TATACCCGTG ATCGGCCAAT TGGGACATTT GCTTGTCCAT TTTGTGGATT CAGCTATATA
CGGACAGGGC CGGATCAATC TCAGAACTCG CGATATCATA TTAGTAAAAT CCAGGAATTT
GGCTTCATAT GGGAAAAGAG ACTGGTTGAA CTGTTCGCTC AACAAATCCT GAGCCTACGT
GCTATAAGCA AGATCTTGGG CGTTGATCCC AATACTGTCA AACGTCATGC ACAACGACTT
ATGGATACTG GTGATAAAAC CTTAAATAAA GACCTATCTA ATACTAATGC TAGCTTAGAT
GCGACTGATC TTCTAATATC ACATCGACAC ACATGGCTTA AAGCGATTAA AGTTTTTCCT
CAATTTGGTG TATCCCAATT GAGAAAACGC TTTAGCACGG CCTATGCTTG GCTTTACCGA
AATGACAAAG CTTGGCTTTT TCTCCATACC CCAGCCAGAT CTAGTCGGAT TTTCCGTACG
AATCGAGTAT CATGGCCAGC CCGGGACAAA TTGCTACTAC AAGCGGTGCA AGAGGCAGCA
ACCAGAATTT TAAACCAGGA ACAATGGCCT AAACAAATGA CCTTAGGTGC TATTGGAAGA
GAGGCAGCGT GTCTCCCGAT CCTACAGCGA CATTTAGATA AGCTGCCGAA AACGGCAGAT
CGTCTGAAAG ATTTGGTGGA AACGAGAGAG GCGTGGGCTG TAAGGCGGCT CCAATGGACT
ATTAACCAAG CACAAAATCA AGGACTCATT CTCAAGGGCT GGGAAATTGT TCAACGCTCA
GGCTTAGGAC GTTTCCCCAA GAAATGGTTT GAACATCATG GTATACAGTT TTAA
 
Protein sequence
MRFASANIAV EELFGTRNIV AVVDLPSRLD ILSNRLPKAL GLNSELLISN HTSLPLFAPF 
LSGDALHRVR RSMLGNKGGT VHSRIGIMAS GVKSLRKLRF CPVCVNEDRS KYGECYWHRV
HNIAGINVCA HHGCILQDSI LSFQRQSTYY AYNAAEDYIP EIEFADPTPA NLHLYALAKQ
AYWLLNNISN ADLKDMYNVY RRCLDLRGLT TASNSIRQQQ LQSTFIDFYT PELLDTISCS
LEDVHQTWLS RLVGGDYVAH HPLRHLLVLH FFGHTPESAL KSLDEPTPDP FGHGPWPCLN
RVCPHSHKDV IETCNVRRSR YTRDRPIGTF ACPFCGFSYI RTGPDQSQNS RYHISKIQEF
GFIWEKRLVE LFAQQILSLR AISKILGVDP NTVKRHAQRL MDTGDKTLNK DLSNTNASLD
ATDLLISHRH TWLKAIKVFP QFGVSQLRKR FSTAYAWLYR NDKAWLFLHT PARSSRIFRT
NRVSWPARDK LLLQAVQEAA TRILNQEQWP KQMTLGAIGR EAACLPILQR HLDKLPKTAD
RLKDLVETRE AWAVRRLQWT INQAQNQGLI LKGWEIVQRS GLGRFPKKWF EHHGIQF