Gene Haur_4658 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4658 
Symbol 
ID5736505 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5952724 
End bp5955108 
Gene Length2385 bp 
Protein Length794 aa 
Translation table11 
GC content53% 
IMG OID641281822 
Productmucin 2 
Protein accessionYP_001547417 
Protein GI159901170 
COG category[R] General function prediction only 
COG ID[COG3889] Predicted solute binding protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0296531 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCTGCT CATCATTAAT GGTTGTGCGC TGGACAACTC CGGTTAATGC TTCTGGGCCT 
GAGGTCTTGC TGCCACCATT TGCCAATTAT GAGGCTTGGA ATGGCACAGC ACCATTTCCG
ATTGGACAAG AAGCGATTCG TAGCAAAATT AGCGATCAGC CAGCTAACCC AAATTTAGTT
GCAGCCGAAA CGCCGATTCC TATTGGGATT GTTGATTTGT TGCCAACCGC CACGCCAACT
GGCCGACCTA CGCCAGCAAA TGAAGCAATT GTAACCTCAA CTACAACGAC TACCCCTACT
CCCACCAAGG ATGAGGGTAT TCCGATTGGT AGTGCAACGG CGGCAACGAC CGAGCCAACT
ATCGAATTAA CTGCCGAACC AAGCTCAACG CCACGGGTTA ATCCAAGCTC GACGGCTGGT
ACGCGCACGC CAACCCTGAA TCCAACGATT AATCCAACGC GACCAACCTT TGAACCAACC
TTTACGAGCA CGCCAACCCG AACGAGTGTT GTGGTGGCTA CTACGGTTGT GCCAGCCAGT
GCAACCTCAA CCAATACTGC AATTCCAACG TCAGCGCCAA CAGCGACCAA CACGCGCGTG
CCTACGATCA CGCCAGTGCC AGCGACGCTG ACCAACACGC CAACCGACGT AGCAACGATC
ACACCAGTAC CAGCAACGCC CAGCAACACG CCAACCGACG TGCCAACCAA TACGCCTGTG
ATCGTTACGG CGACCAATAC GCCAATTCCA ATTTTTACTA CGCCGAGCAA CACGCCAACC
AATACGGCGA CCAATACGCC GAGTAATACG CCAACCAATA CGGCGACACC GACGAATACA
CCGAGCAACA CGCCAACCGA TACGGTGACA CCAAGCAATA CGGCTACGCC ATCGAACACA
CCAACGCCAA CCGATACGGC TACGCCAACT GCCACGCCAG CACCAGAGCT GTATATTGCT
TGGCGCGTTG ATGCGATTGT CAATCCAGTT AATCCATCGA TGGAAAATAA TGATACCAAA
CAGGTATTCG TGGTGTTTGG CAATGCTGGC GATGCTCCGG CATCAGGGGC GCAAGTTAAT
ATTAGCGTAA CTGGAACTTG TATTAGCTCA AGTATTAGCA GCAGTGGCGT ACCAATTACG
CTTGGGGCGC ACCAAGGCTT TACGCTTTCG CCCACAATTT CAGCCAATAA TGTTGGCAAT
TGTTCGATTA CCGCAGTATT AACGGCGATT GGTCAAACCC CAGTTCAAGC AACCTTGAAT
TGGACGATTG TGTGTGATGG TTGTGCTACT GTAACCCCGC AACCAACCAA CACGCCAACC
CGTACACCAA CCAACACGCC AACCCGCACA CCAACACCAA CCAATACTGC GACTCCATCG
AACACGCCAA CGCCAAGCAA CACGCCAACG GCGACCAATA CTCATACGCC AACGACAATT
CCAACCTTGA CCTATACGCC AACGCCCAGC AACACGCCGA CGGTCACCAA CACGCCAACG
CCCAGCAACA CACCAACCAA TACGCCAACG CCGAGCAACA CGCCGACGGT GACCAATACG
CGCACACCAA CCAATACACC AACAATTACC AACACGCCAA CGCCGAGCAA CACGCCAACG
GTGACCAACA CGCCAACGCC AACTAGTACG CCAACGCCAA CTAGTACGCC AACGGTGACC
AATACGCCCG TTGATACACC AACGCCAAGC GAAACACCAA CGCCGAGTGA AACGCCAACA
CCAACTAGTA CGCCAACCCC AACTCCAGAT CTGTTTGTGT TCCACATCGT CAATGGTGTG
GTTAATCCAG CAGGCTCGAT GACATTGGCG GCTGGGGCGC AAGAAAATGT AACGGTTGTG
TTTGGCAATA ATGCTAGCGG CTCGCGGGCA ACTGGCTTGA ATTTCAGTTT CAGTGGCGGG
GCATGTATTA GCGCTCAGCC TGGCTCAAGC GATTCAAGCG ATTTGAATGG CGGCGTAAAT
CGCTCGTTAT CAGTCATTGT GACGGGTAAT GCAGTTGGCT CATGTTCGTT CCGCACTCAA
TTTAGTGCCA GCAACGCCAA TACCGTCAGT GTTGATAGTA GTTTTACCGT GGTCAATACC
GCACTAAATC AACCAGCGAT TGCTGCAACT GCCACGGCAA CTTCAACTGC CACAGCTACG
GCTACCGCTG AACCAACGGC AACTCTTGCG CCAACCGCTA TGCCAACCGA TCAACCAACA
GCTGAGCCAA AAGCGACGCT CGAACCAACC TTGCCAGTAA TTGGTCAAAG CTCAGGGTTT
CCACCAATGT CGGGTGGCCA AATGCTTTGG TTGCTGGCGG GCGGGCTAGC TATTCTGCTC
AGTGGCTTGC GCGGGCGAAG AATCTTACCG CTTAACGTTG CCTAA
 
Protein sequence
MGCSSLMVVR WTTPVNASGP EVLLPPFANY EAWNGTAPFP IGQEAIRSKI SDQPANPNLV 
AAETPIPIGI VDLLPTATPT GRPTPANEAI VTSTTTTTPT PTKDEGIPIG SATAATTEPT
IELTAEPSST PRVNPSSTAG TRTPTLNPTI NPTRPTFEPT FTSTPTRTSV VVATTVVPAS
ATSTNTAIPT SAPTATNTRV PTITPVPATL TNTPTDVATI TPVPATPSNT PTDVPTNTPV
IVTATNTPIP IFTTPSNTPT NTATNTPSNT PTNTATPTNT PSNTPTDTVT PSNTATPSNT
PTPTDTATPT ATPAPELYIA WRVDAIVNPV NPSMENNDTK QVFVVFGNAG DAPASGAQVN
ISVTGTCISS SISSSGVPIT LGAHQGFTLS PTISANNVGN CSITAVLTAI GQTPVQATLN
WTIVCDGCAT VTPQPTNTPT RTPTNTPTRT PTPTNTATPS NTPTPSNTPT ATNTHTPTTI
PTLTYTPTPS NTPTVTNTPT PSNTPTNTPT PSNTPTVTNT RTPTNTPTIT NTPTPSNTPT
VTNTPTPTST PTPTSTPTVT NTPVDTPTPS ETPTPSETPT PTSTPTPTPD LFVFHIVNGV
VNPAGSMTLA AGAQENVTVV FGNNASGSRA TGLNFSFSGG ACISAQPGSS DSSDLNGGVN
RSLSVIVTGN AVGSCSFRTQ FSASNANTVS VDSSFTVVNT ALNQPAIAAT ATATSTATAT
ATAEPTATLA PTAMPTDQPT AEPKATLEPT LPVIGQSSGF PPMSGGQMLW LLAGGLAILL
SGLRGRRILP LNVA