Gene Haur_2178 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2178 
Symbol 
ID5734065 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2759833 
End bp2761854 
Gene Length2022 bp 
Protein Length673 aa 
Translation table11 
GC content50% 
IMG OID641279319 
Productcoagulation factor 5/8 type domain-containing protein 
Protein accessionYP_001544946 
Protein GI159898699 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2730] Endoglucanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00232015 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGT TGAGTAGGCT GATTATTGTT CTTGGTCTTA TTTGTAGTAT TTTTGCCCAA 
CATTCAGCAA CGCCCAAAGC CCAAGCTGCG TTTGGGGCTA GCGATTTTCT CAAGGCCAAC
GGCGCGACAG TGCGCAATAA TTCTGGTAAT GGGGCAATCG TCACGCTCAA AGGCACCAAC
CTCGGCGGTT GGTTGTTGCA AGAAGGCTGG ATGTCGCCCT TGGGATATCC CGCTTTACCG
CGCACCAGCT GGACTGCCAG CGGATCAGCT GGTGGAGCTG CTGCCGCAAT CGATGGCAAT
CCGGCCACGC GCTGGACGAG CAACGCTCCC CAAGCTAATG GTCAATGGTT CCAAGTCGAT
CTTGGCGGCA ACCAAGCCGT CGAACGGGTG ACGATCGACG CTGGCTCCTC AACGGGCGAT
TATCCGCGCC AATATCAAGT TCAGGCTTTT GTTAATAATG CTTGGCTCAC AGTTGGCAGT
GGCAGCGGTA CAAGCCAAGT GGTAACCGTC CAATTCAATA ACACTCAAGT AACCCGCCTG
ATTCGCGTAT TGCAAACTGG CTCAAGCGGC AGTTGGTGGT CAATTCACGA ATTCAATGCT
CAAATTGCTG ATGAATTTAA TTTGCGCCAA GCCTTGACCA ACCGCTTTGG CACAAGCACT
ACCGATAGCT TGATCAACGG CTACCAAGAT ACTTGGATTC AAGCCAGCGA CCTCGATACG
ATTAAAGCTA TGGGCTTAAA CATGGTGCGC GTGCCGATTC ATTGGCTGGT GCTGATGAAC
ACTAATGGCA CGATGAAATC GGATACTGAA TCGTTCCGCA AGCTCGATTG GCTGATTAGT
GAAAGTAGCA AGCGCAATTT ATATGTGATG CTCGATTTGC ATGGCGCTCC TGGTGCTGCT
TGTCCATGGC ATTCATGTGG TCAAACTGGC ACCAACCAAC TCTGGACTAA CCCAACCTAC
CAAAATTGGA CGGTGCAAAT TTGGGAACGC TTGGCGACAC GCTATCGTGG TAACCCAACT
GTGGCCGCCT ACGATTTGCT CAACGAGCCA TTGCTGAGCA ACGGCGCAGC CGAAAACGAG
CAACAGGTGC GCCAAAAATT TGATTTCTTT GATCGTTTGT ATGATGCTGT TCGCGCCAAA
GACCCCGATC ATATGATTGT GATGGCAGCT TTCTATGATT GGTACCAAGC GTTATCGCCT
GCAACCTATG GCTGGACGAA TGTGATGTAT CAATTACACC ACTACAACTT TGATACGGTC
ACTGATTGGA ATGTAACCAA TAATTTCATT CAAAGTGCCT TGGATAAATA CGCCACCTTC
ACCAAGGATT GGAATGTGCC TGGCTTTGCT GGCGAATATT GGTTCTCAAC TCACTACGAT
CTGTATGAAA AATTTATGTC TGGCTTGAAT GCCTTGAATG TTTCATGGAC CAACTGGACA
TACAAGGTCA ATGGCGGCGG CAACTGGGGC TTCTATCAAA ACAATACCCA AGCCGTACCA
GATCTATTAA ATGATAGTGC TGCCACGATC GCCGATAAAT GGTCACGCTT CAGCACCAAT
TATTTCCAAC CAAACACCCA GTTTCAAAAT ACGGTGCGAG CTTATGCGCC GGAAGGTTCG
TGGGTTGCGC TACAAGCTGG AGCCAATAAT AGCTATGTTA GCGCCGATAA CTATGGCAAC
AATCCCTTGG TTGCCAATCG CCCAAGCATC CAAGGCTGGG AAAAATTCCG CATGATCACG
AATCCCGATG GGACGGTTTC GTTTATGTCG CTGGCCAACA ACAAATATGT GGCCGCCGAT
TTGAACAACG GTGGGCGCTT GATCGCCCAA TCACGCGGGG TATTGGGCTG GGAAAAATTC
CGCCGCGTTG ATCTTGGCAA CGGAACCTTT GGCCTCCAAG CAATCGCTAA CAATAAATAT
GTCACCACTG ATCTGAATAG TGGCTCGCCT ATGTTGATTG CCAATCGCGA TGCGATCGGC
GGCGCATGGG AAGCCTTCAC CTTCGTTGCG ACTGCTCCAT AG
 
Protein sequence
MKKLSRLIIV LGLICSIFAQ HSATPKAQAA FGASDFLKAN GATVRNNSGN GAIVTLKGTN 
LGGWLLQEGW MSPLGYPALP RTSWTASGSA GGAAAAIDGN PATRWTSNAP QANGQWFQVD
LGGNQAVERV TIDAGSSTGD YPRQYQVQAF VNNAWLTVGS GSGTSQVVTV QFNNTQVTRL
IRVLQTGSSG SWWSIHEFNA QIADEFNLRQ ALTNRFGTST TDSLINGYQD TWIQASDLDT
IKAMGLNMVR VPIHWLVLMN TNGTMKSDTE SFRKLDWLIS ESSKRNLYVM LDLHGAPGAA
CPWHSCGQTG TNQLWTNPTY QNWTVQIWER LATRYRGNPT VAAYDLLNEP LLSNGAAENE
QQVRQKFDFF DRLYDAVRAK DPDHMIVMAA FYDWYQALSP ATYGWTNVMY QLHHYNFDTV
TDWNVTNNFI QSALDKYATF TKDWNVPGFA GEYWFSTHYD LYEKFMSGLN ALNVSWTNWT
YKVNGGGNWG FYQNNTQAVP DLLNDSAATI ADKWSRFSTN YFQPNTQFQN TVRAYAPEGS
WVALQAGANN SYVSADNYGN NPLVANRPSI QGWEKFRMIT NPDGTVSFMS LANNKYVAAD
LNNGGRLIAQ SRGVLGWEKF RRVDLGNGTF GLQAIANNKY VTTDLNSGSP MLIANRDAIG
GAWEAFTFVA TAP