Gene GSU0419 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU0419 
SymbolflgE 
ID2686295 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp449476 
End bp450735 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content63% 
IMG OID637125084 
Productflagellar hook protein FlgE 
Protein accessionNP_951478 
Protein GI39995527 
COG category[N] Cell motility 
COG ID[COG1749] Flagellar hook protein FlgE 
TIGRFAM ID[TIGR03506] fagellar hook-basal body proteins 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.809638 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGTAA CATCCGCACT GTACACCGGC ATCAGCGGTC TCAACGCCAA CGGCGAGGCC 
ATGTCCGTCA TCGGCAACAA CATTTCCAAC GTCAACACCA TCGGCTTTAA GCAGGGCCGG
ATGCTCTTCT CGGACGTCCT CTCCAGCACC ATCAGCGGCG GGTCCCAGAT CGGCCGCGGC
GTCCAGATCC AGACAGTGGA GAATCAGTTC ACCCAGGGCT CCTTCGAGAG CACCGAGAGC
GGTACCGACC TGGCCATCCA GGGCGATTCC TTCTTCGTGG TCCAGAACAC CAGCGGCCGC
TACTATACCC GCGCCGGCGC CTTCTCCTTC AATAAGGACA AGACCCTGGT GAATCCGGAG
GGATATCAGG TCATGGGGTA CGGCATCATT CCCTCGTCGG GACTTTCCGA CGGCGTGCTC
AAGCCCATCG ATCTGACCAA CTTTGCCACC ACTCCGCCGA AGCAGACTTC CACCGTCAAG
TTCGTGGTGA ACCTGGACTC CACCCAGACC ACGCCGACCC TGGCGTGGGA CCCCGCAAAC
CCGGTTGCCA CGTCCAACTA CTCGACCAGC CTGTCGGTCT ACGATTCCCA GGGCAATGCC
CACACCGCCA CGGTGTATTT CCGCAAGACC GCCGACAACG CATGGGACTG GCACGTCATC
CTCCCCGATG CCGCGGCAGG CACGCCGGGC AGCACCACTA CCCCCATCGA CGGGACCCTC
ACCTTCGATG CCACCGGAGC CCTCACCGCC CAGACTCCCC TGGCCGGCGC GGCCCAGAAC
ATCACCTTCG CGGGCGGCGT CACCGCACCC CAGCCGATCT TCTTCGACCT GGGAGTCGGC
GCTACCACCC AGTACGCCAG CTCGTCGGTG GTTTCTTCCC AGACCCAGGA CGGCTACTAC
CAGGGCACCC TCACCAAGGT AACCATCGAT GACAAGGGAT ACGTGAACGG CGTGTACTCC
AACGGCCAGC TTCAGAAGCT CTACCAGGTG GCCCTGGCCA AGTTCTCCTC CACGGCCGGC
CTGTCCAAGG CGGGTGGCAC CCTCTTCGAG GAGACCCTCG AGTCGGGACA GCCCCTGTTC
TCCGACGCCA GCACCCCCGG CGTCGGCAAG ATCCTCGCCA ACTCCCTGGA GCAGTCCAAC
GTTGACATGG CGGCCCAGTT CGTCAAAATG ATCACCACCC AGCGTGGCTA CTCCGCCAAC
TCCAAGACGA TCACCACGGC CGACGAGATG CTGCAGGAAG TGCTCAGTCT CAAGCGGTAA
 
Protein sequence
MSVTSALYTG ISGLNANGEA MSVIGNNISN VNTIGFKQGR MLFSDVLSST ISGGSQIGRG 
VQIQTVENQF TQGSFESTES GTDLAIQGDS FFVVQNTSGR YYTRAGAFSF NKDKTLVNPE
GYQVMGYGII PSSGLSDGVL KPIDLTNFAT TPPKQTSTVK FVVNLDSTQT TPTLAWDPAN
PVATSNYSTS LSVYDSQGNA HTATVYFRKT ADNAWDWHVI LPDAAAGTPG STTTPIDGTL
TFDATGALTA QTPLAGAAQN ITFAGGVTAP QPIFFDLGVG ATTQYASSSV VSSQTQDGYY
QGTLTKVTID DKGYVNGVYS NGQLQKLYQV ALAKFSSTAG LSKAGGTLFE ETLESGQPLF
SDASTPGVGK ILANSLEQSN VDMAAQFVKM ITTQRGYSAN SKTITTADEM LQEVLSLKR