Gene Haur_5123 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_5123 
Symbol 
ID5737081 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009973 
Strand
Start bp165724 
End bp168573 
Gene Length2850 bp 
Protein Length949 aa 
Translation table11 
GC content57% 
IMG OID641282288 
Producthypothetical protein 
Protein accessionYP_001547879 
Protein GI159901633 
COG category[S] Function unknown 
COG ID[COG1262] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGATA GCATTAATGG TTCTGTCGAT CTCGATAAAT CGCAAGTAAC TGGTGTGATC 
GTTGGAGTTA ATCTTGGCAC GATTATCTAT GGCCGTCCAC CAGAGGAAGC CGAGCGCCAG
CGCTTAGTCG CCTACTTGGA TCAGGTGACC AAAAGCCATA ATACGCTGCG GGTAGTCGGG
GTTGGCTCGT CCCATCTCGC GTCAGGCATT GACCTCGCAT CCGCCTATAT GATGCTGGCG
GTGCAGGGGC GGCAGCGGGT GGTGCGGACA CTGACGGCGG AAGAAATTGC GGCCCATCGC
CAGCAGCGCT TTGAAATTCC TGAGGAACTG AGCGCTGATC GCTGTTTGCC CGATCACGCC
GTGCTTGCGG TTGCCAAGCG AGATGGTTAC TTGGCGTTGC TCCGGGCGGA ACTGGCGACG
GAAACCGTGT TGGCGCATCC CTACCTCGTA TTGTGTGGCG CACCGGGGAG CGGCAAATCA
ACCTTCGCTA AGCATCTGGT GTGGGCCTTG GCGCAGCGTG GCCTTGACCA GATTAATCAT
CACACGGGCT TGCTTGGCTG GGCTGACAAA CAGCGCGTGT TGCCCGTGTT TATGCCTTTA
CGCACGTTGG CGGGTGCGTT GGTGGGCAAG GATTTAGGGT TGAACAACAC CCCCCATATT
GGGCTGTTGC TTGATGCGGT GTGTGCCCAT CTGCAAACGA CCTATGGGCT TGAGCAGCCG
CGTGAGCTTT TAAGTGCTGG ACTGGATCGC TCGCGCACGG TCTTGTTGGT GTTTGATGGC
TTGGATGAAG TGCCACTGGA AGCCACTGAC CACAGCCTTG ATCGCCGCTC GCTCTTGACC
TATGTCCGCT TGTTTGCCAA TGCCTATGCT GCTCGTATCC TCATCACCTG CCGCTCGCGG
GCCTGGACGG AGGAGTATGG ACAGATCACG CAGTGGCCAA TGGTTGAACT GGCTCCGTTG
AGCGGTGGCC AAATGACCCA GTTTATTCGC ACATGGTTTC CATTGTTGCA TGCCAAGGGT
CTGATTGATC ATGAGGCCAT TGAGCGCTAT AGTGATCAGT TGACGCAGGC GTTGCGCGAT
CCCCAGCGCC GCCGCTTACG GGACATGGCC GACAATCCGT TGCTGTTGAG TATGATGATT
TTTGTGTTGG CTCGCAAGGG TGTGTTGCCG CGTGACCGCC ATAGCCTGTA TGACGATATC
CTGAAACAAC TCTTGGGCGA GTGGGATACC ACCAGTCGCA ATGGGCAGAA CTTGGGGCAA
GCGGTTGGGG ATGATCGGAT CATGGGCGAC GAGGTGCGCG ATCAGGTATT GGATCGGTTG
TGTTATCAGG CGCATTTAAC CGCCACGTCA ACGGATGGGC GTGGACGAAT TCCGAGCCGT
GAGCTTCAAT TTGCTTTGAT GGAGTATTTC GCCCGCGTCA ACGTGGCCGA CCCCTATCGG
GCGGCGGAGC GCTGTATCGC CTATATTGAT CAATGCAGCG GCTTGCTTCA GCCGGAGGAT
GATGGGAAGG TCTATGCCTT TGCCCACTTG ACGTTGCAGG AACAGAGTGC TGGTCGCCAC
TTGGTGTTTT ATGAATCACT CGATCAGTTG TTGGCCTTAC GCCGTGATGA CCGTTGGCGC
GAACCGATCT TTTTAGGCGT TGGCTGCCTG ACGAAGGTGG GGCTTGGAAG TGCCAAAATT
GACCAACTCC TGACGACCTT GGTTGACTCC GATGCCTATG AAGCGGGAAC CATGCATCAA
TACGACTGGT ATCGCGATCT GATCTTAGCT GCTGAGTTGG GGGCGGACTG CGACTGGGGC
TTGTTGCGCG GCAAGCAGAT CAAGGTTGAT CGCATCCAGC GACGGTTGCG GGCGGGGCTG
GTTAACCTGC TTGAAGACCA CGACCATTCC CACGCAGCAC TTGCGTATCA CCACGGCCAA
GCGATGGAGC CAGCGCCGTT ATTGGTGCGC GAACGGCAAA AAGCTGCCGA ACTTCTCGCA
GGCTTGGGCG ACCCACGCTA TCCGGTAACC ATAGCGCAAT GGCAACAGGA GACGCGCGAT
CTGTCCACCC AGTTTGGCCG CGAGGGCAAC CATTATTGGC GCTACATCCC TGCGGGCCGT
TATCAGGTTG GCGGGTGGGA TGCAGACGAA CAATCCACAG TGGTTGAACT TCAGGATTAC
TGGGTCGGGC GGTTTATGGT GACGGTGGAA CAATATCGGG CGTTTATGGA GGCTGGTGGC
TATACGAATA AGGATTATTG GACGGAACAT GGATTACAGT GGAAGCAACG CGAACAACGA
ACAGAACCAC GCTGGTGGTA TGACCAAACC GAGCAAGAAT ACCGCAATCG ACCATTCTAT
GGAGTGAGTT GGTATGATGC GGTGGCCTAT TGCCAGTGGC TGACGGATCA GCTTACGCCA
TGGCTGCCGC AGGGGTATTG TATTCGGTTG GTCAGTGAGG CGGAATGGGA GGTGTCCGCT
GCCTATACCG CCGACGGACA GCGCCAACCG TTCCCGTGGG GTGAGCAGCC CGCCACGCCG
GAGCATACGG TGTACAATTG GAGCAGGGAA AAACGCCCCT TATCCGTTGG TTTAGGGCTG
GTTGGCCAAG CGGCGTGTGG CGCACTGGAT AGCGTTGGCA ACATGTGGGA GTGGACGGCC
ACGCGCGATG AGGACAACGG TGGCAATGGG CAGCAGGTGC TTGCGGATAG TGACGACCTT
ATGGTGCTGC GGGGTGGCTC AGGGTACGAA AATAGTATAA ATGTTCGTTG CGCGGCGCGT
CTCAGGAATC CTCCCGGCAA CGGCGTCACC ATTCTTGGAT TTCGTTGTAT TCTCGCCCAT
CGTACATCTG TTCTGAATCC TGAATCCTAA
 
Protein sequence
MADSINGSVD LDKSQVTGVI VGVNLGTIIY GRPPEEAERQ RLVAYLDQVT KSHNTLRVVG 
VGSSHLASGI DLASAYMMLA VQGRQRVVRT LTAEEIAAHR QQRFEIPEEL SADRCLPDHA
VLAVAKRDGY LALLRAELAT ETVLAHPYLV LCGAPGSGKS TFAKHLVWAL AQRGLDQINH
HTGLLGWADK QRVLPVFMPL RTLAGALVGK DLGLNNTPHI GLLLDAVCAH LQTTYGLEQP
RELLSAGLDR SRTVLLVFDG LDEVPLEATD HSLDRRSLLT YVRLFANAYA ARILITCRSR
AWTEEYGQIT QWPMVELAPL SGGQMTQFIR TWFPLLHAKG LIDHEAIERY SDQLTQALRD
PQRRRLRDMA DNPLLLSMMI FVLARKGVLP RDRHSLYDDI LKQLLGEWDT TSRNGQNLGQ
AVGDDRIMGD EVRDQVLDRL CYQAHLTATS TDGRGRIPSR ELQFALMEYF ARVNVADPYR
AAERCIAYID QCSGLLQPED DGKVYAFAHL TLQEQSAGRH LVFYESLDQL LALRRDDRWR
EPIFLGVGCL TKVGLGSAKI DQLLTTLVDS DAYEAGTMHQ YDWYRDLILA AELGADCDWG
LLRGKQIKVD RIQRRLRAGL VNLLEDHDHS HAALAYHHGQ AMEPAPLLVR ERQKAAELLA
GLGDPRYPVT IAQWQQETRD LSTQFGREGN HYWRYIPAGR YQVGGWDADE QSTVVELQDY
WVGRFMVTVE QYRAFMEAGG YTNKDYWTEH GLQWKQREQR TEPRWWYDQT EQEYRNRPFY
GVSWYDAVAY CQWLTDQLTP WLPQGYCIRL VSEAEWEVSA AYTADGQRQP FPWGEQPATP
EHTVYNWSRE KRPLSVGLGL VGQAACGALD SVGNMWEWTA TRDEDNGGNG QQVLADSDDL
MVLRGGSGYE NSINVRCAAR LRNPPGNGVT ILGFRCILAH RTSVLNPES