Gene Haur_2193 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2193 
Symbol 
ID5734080 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2779227 
End bp2781599 
Gene Length2373 bp 
Protein Length790 aa 
Translation table11 
GC content52% 
IMG OID641279334 
Productcarbohydrate-binding family V/XII protein 
Protein accessionYP_001544961 
Protein GI159898714 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00201516 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAACGTT TTTTTTCATT GATGTTGGTA TTTAGTTTAT TTGCGAGTCT CTTATTCAAC 
CAAGCTACTC ACGCCGCACC AAACCCATGG GCAGCCAATA CGGCCTATGC GGTTGGCAGC
CAAGTTAGTT ACAATGGCAC GATCTATGAG TGTTTACAAG CCCACACGGC CTTAGTTGGT
TGGGAGCCAG CCACCACACC CGCGCTCTGG AAGCAGGTTA GTACTGCTCC AAGCGCTACG
ACAATTCCCG CAACAAGCAT TCCGCCAACC GCTCCCCCTG CCACAAATAT TCCGCCAACT
GCAACCGCAA CGCCGACGGC TGGCTGTTTA AATCCAGCGG CGGGCGTTGG TTGGCAAAGC
CGCAGTTTAA GCAACCAAAC TGGGAATTTT AGTTTGAGCT TTGAGGCAAC GCTGAGCGCT
AGCCCCACCA ATGCTGTGAT TGGCTTGGCC AATGCTGTGC CAACCGATTA CACTGGTTTA
GCAATTGCTG TGCGCTTTAA TCCAACTGGC TCAATCGATG CCCGCAACGG CGGAACTTAT
GCTGCTGCTA CCAATGTGGC CTACAGTGCT AATACACGCT ATCGTTTTCG GATTGTTATC
AATCTTGTGG CTCATACCTA CAGTGTTTAT GTTACGCCGC AGACTACTAG CGAAGTTTTG
ATTGCCAGTA ATTTTGCTTT TCGCAGCGAA CAAGCCAATG TGAGCAATTT AAACACCTTG
GCGACGGTGG TTGGTGGCAC AGGAGCCGTT GGCTCACTCA ATCTTTGCAA TATCAGCCTG
AATCTGCCTG CAACTCCAAC GCCGCTGCCA ACCGCAACGC CAACCCCACG ACCAACCGCA
ACCCCAGTGC CAACCGCAAC TCCAGCGCCA ACCGGTTATA CCCCAGCTTT TTGTGCCAAC
TATCCACCAG CAATCGTCGC AGGCAGTTGG CAATCATCGG TGGTCAGCTA TCACAATGGG
CGTTTGCAAT ATACCAACGA TAGTGCCCAA AACCGAATTC CTGATTTTAG CTATGCTGGC
TATTATTCAG GCCAACGGCC ACTGCCAAAT CTGGCGGTTG TCCAAACACT CAGCCCAATC
AGTGGCGATA ATACCGCCCG CATTCAACAG GCACTCGATG CAATTGGCAA TCGCACGCCC
GATGCCAATG GTTTGCGTGG AGCATTATTG CTTGCACCTG GCCGCTACAA CATCAACGGA
ACCTTACGCA TCAACAAAAG TGGCGTGGTG CTGCGCGGCA GTGGCGATGG CAGCGATGCT
AGCACTTCCA CGATTTTGCT AGGAGTTGGC AACACGCCGC ATCAACGCAC ATTAATTGTG
GTGGGCAACG GCGATTCAAC CCCGTGGACG GCTGGCTCCG CCACCAACGT GACCGACCAA
TTTGTACAAG TTGGCAGCAA AAGCTTGAAT GTAGCCGATC CCAGTCGTTT TACGGTTGGC
CAAGAAGTGA TTGTGCGCCA CCCATCATCA CAAGCATGGA TCAACGCGGT CAATGGTGGC
GGGGTAGTCA ATGACGCTTG GTGGGCGGTC GGTGCTTTAG ATATGACTTG GACGCGCCGA
GTAACTAAGA TTGCTGGTAC AACCCTGACG CTTGATGCTC CAATTTTCAA TCATTTGGAT
CGGGCGTTGA GCCAAGCGAC GGTTGCTCCG GTTGCCAGCC GCACTATCAT CGCCAATGCT
GGGGTAGAAA ATCTGCGGGT TGATATTCAG ACCGCTGGCG GCGAGGATGA GAACCACGTT
TGGGATGCAA TTGGGATTGT CGGGGCAGAA AATAGCTGGG TCAAAAATGC GACAGTCTTG
CACTTTGGCC ATGCTGGGGT GTTTACTCAA GGCGCAATTC GCATCACGGT TGAAGATGTG
CAGGCACTTG ATCCAGTTGG CATTCGGACT GGTGGCCGTT TTTACAACTT CGATGCCGAA
TCGAATAGCC AACTCGTGTT GTTTACGCGG GTTCATGCCA CTGGCGGTCG CCACAACTTT
ATTTCCAATG GAACTCAAAC CACCTCGGGG ATTGTTTGGC ATCGTTCGAC TGAAGGCGGC
GGCTCGGATA GCGAAGGCCA TCGTCAATGG AGCCAAGGTC TATTGTTCGA CACCATTAAT
GCTAGTGCCG CCAGCAATAT CAAGCTGATC AACCGTGGCG ATTATGGCAC ATCGCATGGC
TGGGGCAATG TGCATTCAGT CATCTGGAAC TACAATCGCA CGATGATGGT GCAAAAGCCG
CCAACTGGCC AAAACTATGT CATCTCACAG GCTGGCACGC GTAGCACCTC GTATCCCTTC
CCGGGCGCTG GTGGTTTTGC CGATATTCGC AGCGGCAGTT TAGTACCCAA TTCGCTCTAC
GAAGCTCAAC TTTGTGATCG GCTAGAACAG TGA
 
Protein sequence
MQRFFSLMLV FSLFASLLFN QATHAAPNPW AANTAYAVGS QVSYNGTIYE CLQAHTALVG 
WEPATTPALW KQVSTAPSAT TIPATSIPPT APPATNIPPT ATATPTAGCL NPAAGVGWQS
RSLSNQTGNF SLSFEATLSA SPTNAVIGLA NAVPTDYTGL AIAVRFNPTG SIDARNGGTY
AAATNVAYSA NTRYRFRIVI NLVAHTYSVY VTPQTTSEVL IASNFAFRSE QANVSNLNTL
ATVVGGTGAV GSLNLCNISL NLPATPTPLP TATPTPRPTA TPVPTATPAP TGYTPAFCAN
YPPAIVAGSW QSSVVSYHNG RLQYTNDSAQ NRIPDFSYAG YYSGQRPLPN LAVVQTLSPI
SGDNTARIQQ ALDAIGNRTP DANGLRGALL LAPGRYNING TLRINKSGVV LRGSGDGSDA
STSTILLGVG NTPHQRTLIV VGNGDSTPWT AGSATNVTDQ FVQVGSKSLN VADPSRFTVG
QEVIVRHPSS QAWINAVNGG GVVNDAWWAV GALDMTWTRR VTKIAGTTLT LDAPIFNHLD
RALSQATVAP VASRTIIANA GVENLRVDIQ TAGGEDENHV WDAIGIVGAE NSWVKNATVL
HFGHAGVFTQ GAIRITVEDV QALDPVGIRT GGRFYNFDAE SNSQLVLFTR VHATGGRHNF
ISNGTQTTSG IVWHRSTEGG GSDSEGHRQW SQGLLFDTIN ASAASNIKLI NRGDYGTSHG
WGNVHSVIWN YNRTMMVQKP PTGQNYVISQ AGTRSTSYPF PGAGGFADIR SGSLVPNSLY
EAQLCDRLEQ