Gene GSU3037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU3037 
SymbolfliD 
ID2686780 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp3342898 
End bp3344307 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content58% 
IMG OID637127730 
Productflagellar hook-associated protein 2 
Protein accessionNP_954079 
Protein GI39998128 
COG category[N] Cell motility 
COG ID[COG1345] Flagellar capping protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAGTG TATCGTTTGG CGGATTGGCA ACGGGACTGG ACTCAAATAC GCTCATTTCC 
CAGCTTATGT ACCTGGAGCG CGCACCCGAG CGGATCCTGG AGAGCAAGAA GAGCACGATC
AGCAGTCAGA TCGATGTCTA TACTCAGGTC ACGAACTTGC TGAATTCGTT CAAGACCCTG
GCTGCCGGGA TGAATACCGC CACCGGTTTC ATGGGCAAGA CCACGTCAGT GGGGGACAGT
ACGGTCGCGA CGGCCACATC GTCCAGCATC GCATCGCCGG GCAGCTTTAA CCTGACGGTG
AACTCCCTGG CAAAGAACGA GCGGCAGGTT GTTGACCAGG GATATGCCTC GGCAGACGCG
CTCAATTTCA AGACCGGCAC CTTCACCATC AGCGGCGTGG CCACTCCCAT CACCATTGCC
GAAGGCCAGA ATTCCCTTCA GGGGATCGCC TCGGCCATCA ATGCTTCCGG TGCGAACGTG
ACCGCGTCGA TCATCAACGA CGGTTCGGCG AATCCCTACC GGCTTGTTAT CACCGGCAAG
GACACGAACA ATTACACCCT GAACTTCTCG GGGCTCACCG GCGATCCGGC CAGCGGCTCC
GCCTATACCA CGCCCACCAT CACCAAGTCG GGTCCGACCT ACCAGGCGGG AGCGGCTGCC
AGTTTCTCCG TAAACGGCAT CGCCATTACC AAGACCTCCA ACATCGTCAC CGACGTAATC
CCGGGCGTGA CTCTCACCCT GCTCAAGGAG GGGGGGGCAA CCACTACGGT GACCGTGGGG
AACGACACAT CCGGCGTCAC CAAGAAGATC AACGACTTTG TCGGCGCCTA CAACGCAGCC
ATGTCGCAGA TCAACAAGCA GTCCGAATAC AATGCGACCA CCAAGAAAGG AGGGGTCCTC
TCCGGGGACT CTACCCTCCG CAGCGTGAAG ACGCAGCTCC AGAACGTCCT GACCACTCCC
GTGGCAGGGA TAACCGGCAA ATACTCCACC CTGGCCGATA TCGGCATTAC TACCGACCGG
TCCAACGGCA CGCTCACCGT TGACGCGACC AAGCTTGCCG ATGCACTGGG CAGCAATTTC
AACGATGTGG TGGAGCTCTT TACGAAAAAC GGCGGCGTCT CCAACCTGGA CACGGAAAAA
TATGGCGTTG CCGAGCAATT CAGGAAGGTC ATCGATCGCT TTACCCATGC CTACGAGGGG
CCGTCCTCCA CTGCCAACGG CATCATTTCG AGTCGGGTGC GGGGGCTCAA CGACACCATC
AAGAGCATCG ACGATCAGAT CGATGCCATG GAAGTGAGGA TGGAGCGCAA GGAAGAGGCC
CTGAAGAAGC AGTTTACCGC CATGGAGACC CTTGTCAGCA GTCTTACAAC GCAGGGGAAT
TCGCTTATAA GTTACCTGTA CGGGTCGTAG
 
Protein sequence
MASVSFGGLA TGLDSNTLIS QLMYLERAPE RILESKKSTI SSQIDVYTQV TNLLNSFKTL 
AAGMNTATGF MGKTTSVGDS TVATATSSSI ASPGSFNLTV NSLAKNERQV VDQGYASADA
LNFKTGTFTI SGVATPITIA EGQNSLQGIA SAINASGANV TASIINDGSA NPYRLVITGK
DTNNYTLNFS GLTGDPASGS AYTTPTITKS GPTYQAGAAA SFSVNGIAIT KTSNIVTDVI
PGVTLTLLKE GGATTTVTVG NDTSGVTKKI NDFVGAYNAA MSQINKQSEY NATTKKGGVL
SGDSTLRSVK TQLQNVLTTP VAGITGKYST LADIGITTDR SNGTLTVDAT KLADALGSNF
NDVVELFTKN GGVSNLDTEK YGVAEQFRKV IDRFTHAYEG PSSTANGIIS SRVRGLNDTI
KSIDDQIDAM EVRMERKEEA LKKQFTAMET LVSSLTTQGN SLISYLYGS