Gene GSU1374 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1374 
SymbolhylB 
ID2687929 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp1501915 
End bp1503540 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content64% 
IMG OID637126049 
Productmethyl-accepting chemotaxis protein 
Protein accessionNP_952427 
Protein GI39996476 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGCTGC AGAACCTGAA AATCGGCACC AGACTGTACG GATTGATCGG CTTCATGTCG 
ATCCTGTTGA TCGTTATCGG CGCACTGGGG CTCAACACCG CGAGAACAGC CAACAACGGT
CTGGACACGG TGTACAGGGA CCGCGTGCTC CCCCTGAAGG ATCTGAAAAT CATTGCAGAT
ATGTACGCGG TGAACATCGT GGACGTGTCC CACAAGGTGC GCAACGGCAA TATCACCTGG
ACCGAGGGAC GCAAGAGCGT CGAAGAGGCG AAAAAGACCA TTGCGGAGAA GTTGCAGGCG
TATCTGGCCA CCAATCTGGC GGAAGAGGAG AAAAAACACC TCGAAGAGGC CAAGCCGCTG
ATCAAGGTCG CCGATGCCAC GCTGGAGCGG CTGGCATCGA TTCTGAGCGC CGAAGATGCC
GAAGCCCTCA CCGCTTTCAC GGTTTCCGAG CTCTACCCGG CCATCGATCC GGTGTCGGCT
AAGTTCAGCA GCCTGGTGGA CGACCAGTTG AAGATCGCCA AGCAAGAATA CGACCACAGC
TCCGGATTGT ACCGGGCAAG CCGGACCATC TCCTTGGTGG CGATCATCGT GGGAGTACTG
ATCGCCGGCA CCGCCGGCCT GCTCATTACC CGCTCCATTA CCGGCCCCCT GGCCGAAGGG
GTCGAGGTGG CCAACCGGCT TGCCGCCGGC GACCTGACCG TGGAGGTCCG GGCTGGCGGC
AGGGACGAGA CCGGCCAGCT CATGGCCGCC ATGGGGAACA TGGTGACCAG CCTGCGCCAC
CTGATCGCCG AGGCAATCAG CATCTCCCAC GGCATTGCCT CCGCCTCCAA CCAGCTCCAC
GCCACCTCCG AGCAGATCGC CACCGGTTCG GAGGAGGTTG CCAGCCAGGT GGGTGCCGTG
GCCACGGCCA GCGAGGAGAT GTCGTCCACC AGCCGGGACA TCGCCCAGAA CTGTACCCTG
GCCGCGGAAA GCTCCCGCGA AACCAGCGTC ACCGCGTCCA ACGGTTCCGC CGTCGTTCAG
GAAACCAACA GCGGGATGGT GGTCATTGCC GAGCGGGTCA AGCAGACCGC CGGCACCGTG
GATGCCCTGG GCCGCCGCTC CGAGCAGATC GGGGAGATCA TCGGCACCAT CGAGGACATC
GCCGACCAGA CCAACCTGCT GGCCCTCAAC GCCGCCATCG AGGCGGCCCG CGCCGGCGAG
CAGGGACGCG GCTTCGCCGT GGTGGCCGAC GAGGTACGGG CCCTGGCCGA ACGCACCACC
AAGGCCACCA AAGAGATCAG CGGCATGATC AAGGCCATTC AGAACGAGAC CAAGGCCGCC
GTCCAGGCCA TGGAAGAAGG GGTCGGCGAG GTTGAAAAGG GTTCCGTCAC CTCCCACAAG
TCGGGCCAGG CCCTGGCCGA GATCCTGGAC CGGATCAACG ACGTGACCAT GCAGATAAAC
CAGATCGCCA CCGCCGCCGA AGAGCAGACC GCCACCACCG GCGAGATCAC CTCCAACATC
CAGCAGATCT CCGACGTGGT CCAGCAGACC GCCCGGGGCG CCGAGGAGGT CTCCGCCGCC
GCGGCCCAGC TGGCCCAGCA GGCCCATCAG CTCCAGAACG TGGTGGGGAA CTTCAGGATC
GCATGA
 
Protein sequence
MMLQNLKIGT RLYGLIGFMS ILLIVIGALG LNTARTANNG LDTVYRDRVL PLKDLKIIAD 
MYAVNIVDVS HKVRNGNITW TEGRKSVEEA KKTIAEKLQA YLATNLAEEE KKHLEEAKPL
IKVADATLER LASILSAEDA EALTAFTVSE LYPAIDPVSA KFSSLVDDQL KIAKQEYDHS
SGLYRASRTI SLVAIIVGVL IAGTAGLLIT RSITGPLAEG VEVANRLAAG DLTVEVRAGG
RDETGQLMAA MGNMVTSLRH LIAEAISISH GIASASNQLH ATSEQIATGS EEVASQVGAV
ATASEEMSST SRDIAQNCTL AAESSRETSV TASNGSAVVQ ETNSGMVVIA ERVKQTAGTV
DALGRRSEQI GEIIGTIEDI ADQTNLLALN AAIEAARAGE QGRGFAVVAD EVRALAERTT
KATKEISGMI KAIQNETKAA VQAMEEGVGE VEKGSVTSHK SGQALAEILD RINDVTMQIN
QIATAAEEQT ATTGEITSNI QQISDVVQQT ARGAEEVSAA AAQLAQQAHQ LQNVVGNFRI
A