Gene Haur_3105 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3105 
Symbol 
ID5734977 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3917677 
End bp3919575 
Gene Length1899 bp 
Protein Length632 aa 
Translation table11 
GC content53% 
IMG OID641280249 
Productpeptidase M14 carboxypeptidase A 
Protein accessionYP_001545871 
Protein GI159899624 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000214938 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCCGT CACGCATCGT TCGGTTGGTT GGTTCACTCG CATTAGCCGC AGGGCTGATG 
GCTCCGTTGA GTGCTTTGGG GCAAACACGG CAGCCAGTTC AGCAAACGGA GCCGCTTGAT
CAGGCGCGGG CCTATCATCT TGAAGGCGTA ACCACGCGCG AAGATCGCAA TGCAATTGCC
GCAACTGGTG CTTCAATTGA TGCAGTTCAT GGCAAGGTGT TGGATATTAC CGCCAATGCC
GAAGAAGCTG CGGCGATTGA GCGCTTAGGC TTTAAATTGG TCGAGCTACC TGAACTGACC
GATTTTCCAG GCGCAGATTC GGCCTACCAT AATTATGCTG AGATGACCAG CAATATTGCG
GCAGTTGTTG CCAGCAAGCC GAGCATTGTG AGCCGCTTTA GCATTGGCCG CTCGTATGAA
AATCGCGATT TGATTGCGGT TAAAATTAGC GATAATGTCG CAACCGATGA GAACGAGCCA
GAAGCCTTGT TCATCGGCCA GCACCATGCC CGCGAACACC TGACCGTCGA AATGACCCTG
TATCTGTTAC ATTTGCTGGT CGATAACTAT GGCATTGACA ATCGGATTAC CAACATTGTC
AATAGCCGCG AAATCTACAT CGTTTTCAGC TTGAACCCTG ATGGCAGCGA ATACGACGTA
GCAACTGGCA GCTATCGCAG CTGGCGCAAA AATCGCCAAC CCAACAGTGG CTCTTCCTAC
GTTGGCATCG ACCTTAACCG CAACTATAGC TACAAATGGG GCTGCTGTGG TGGCTCAAGT
GGCTCAACCT CGAGCGATAC CTATCGGGGC ACGGCAGCCT TTACCGCTCC CGAAACCCAA
GCGATTCGTA ATTTCGTCGC TAGTCGGGTA GTTGGCGGTA AACAACAAAT CAAAACCTCG
ATTTCATTCC ATACCTATAG CGAACTGGTG TTATGGCCAT ATGGCTACAC CTACGATGCC
TATCCGAGCG ATATGGTACG CGACGATTAT GATGCAATGG CCGCGCTTGG CCGCACTATG
GCATCGAGCA ATGGCTATAC ACCACAACAA TCCAGCGATT TGTATGTTGC TGATGGCACC
TATGAAGATT GGGCGTATGG GGTGCATCGA ATCTTTGCCT ATACCTTTGA AATGTATCCT
CGCTCTTCGA GTCCAGGTTT CTACCCACCT GACGAAGTGA TTAGCCGTGA AACCACCCGT
AACCGTGAAT CGGTCTTGTA TTTGTTGGAA CAAACCGATT GTCCCTATCG CGTGATTGGC
AAAGAAGCTC AATATTGTAG CGGCGGTGGC ACGCCAACCC CAACCGCGAC ACCTGGGCCA
ACCGCAACCC CAGGCCCAAC CGCTACACCA AACCCAGTCG TCACCGTATT TAGCGACGAT
TTTGAAGCCA ATCAAGGCTG GACAACCAAT CCCAATGCGA CTGATAGCGC AACCACCGGC
GCATGGGAAC GGGGCGACCC TGAAGCAACC GATAGCAGCG GAGCCAAGCA GCTTGGCACA
ACTGTCAGCG GCAGCAACGA CCTTGTAACG GGCCGTTTGG CTGGCAGTTC AGCTGGAGCC
TACGACCTTG ATGGTGGCTC ATCGTCAGTC CGTTCGCCAG CCTTCACCTT GCCAAGTTCT
GGTAATTTGA GCTTGAGTTT CAGCTACTAC TTGGCTCATG GCTCGAATGC CAGCAGCGCC
GATTACTTCC GCGTGTCGCT CGTCACCAGC TCTGGCACGG TCAAAGTCTT TGAAAAATTG
GGCAGCGCAA CCGATGTTGA TGCCGCCTGG ACAGCCGCTA CTGTTAGCTT AAACAGCTAC
GCTGGTCAAT CGGTGCGAAT CTTGATTGAA GCTTCCGATG CCAGCACTGC TAGCTTGGTA
GAAGCTGCTG TGGATAATGT TAGCGTGACC CAACGCTAA
 
Protein sequence
MKPSRIVRLV GSLALAAGLM APLSALGQTR QPVQQTEPLD QARAYHLEGV TTREDRNAIA 
ATGASIDAVH GKVLDITANA EEAAAIERLG FKLVELPELT DFPGADSAYH NYAEMTSNIA
AVVASKPSIV SRFSIGRSYE NRDLIAVKIS DNVATDENEP EALFIGQHHA REHLTVEMTL
YLLHLLVDNY GIDNRITNIV NSREIYIVFS LNPDGSEYDV ATGSYRSWRK NRQPNSGSSY
VGIDLNRNYS YKWGCCGGSS GSTSSDTYRG TAAFTAPETQ AIRNFVASRV VGGKQQIKTS
ISFHTYSELV LWPYGYTYDA YPSDMVRDDY DAMAALGRTM ASSNGYTPQQ SSDLYVADGT
YEDWAYGVHR IFAYTFEMYP RSSSPGFYPP DEVISRETTR NRESVLYLLE QTDCPYRVIG
KEAQYCSGGG TPTPTATPGP TATPGPTATP NPVVTVFSDD FEANQGWTTN PNATDSATTG
AWERGDPEAT DSSGAKQLGT TVSGSNDLVT GRLAGSSAGA YDLDGGSSSV RSPAFTLPSS
GNLSLSFSYY LAHGSNASSA DYFRVSLVTS SGTVKVFEKL GSATDVDAAW TAATVSLNSY
AGQSVRILIE ASDASTASLV EAAVDNVSVT QR