Gene Glov_3220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGlov_3220 
Symbol 
ID6368748 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter lovleyi SZ 
KingdomBacteria 
Replicon accessionNC_010814 
Strand
Start bp3444180 
End bp3445832 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content57% 
IMG OID642678637 
ProductGeneral secretory system II protein E domain protein 
Protein accessionYP_001953446 
Protein GI189426269 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGATCA ATGTCAGAGA AGGTACGCTG GGTGCCATCC TCTACAACTC ACGTATCATC 
AGTGAAGCCG ACATCACTGC CGCCCTTGAA GAGCAGCAGC GCAGCAGCAG CCGCTTCGGT
GAAGCCCTGG TATCGCTGGG AATTGTCACC CAGGAGGATA TCGACTGGGC ACTCTCCAAT
CAGCTGGACA TCCCCTATAT CCGTCTCAAG CAGGAGATGA TTGACCCGGA GGCGTTGAGC
CTGCTGCCGC CGCACCTCTG CCGGATACAC CAGCTGATCC CGTTGATCCG GGCTGGAGAT
GAACTTTCAA TTGCCATTGC CGATCCGCTC AACAAAGAAG CAGTGACCGC GGCAGCAGAG
GCTTCAGGCT GCCGGATCAA CCTCTCGGTG GCCCTGATCC GGGAGATCAA TGAGATGCTT
GACCTCTGTT ATGGTCTACC CAGGGAAGAT CTGCTGGGGT TCAGCTCCGG CCTTCTATCA
CCGGAACAGC TCAGCTCCAT CAATGCCGAC AGCAGCGGCC AGCAACTGAT CAACAGTCTG
CTGGCCTACA GCATCCAGCA CCAGCTGACC TCCTTTTCCT TCAGACCGCT TGAGGATCTG
ATCAGCATCT CCGGCCGCAG CGGGGCAACC AGCCATGAGC TGGGACAGTT GACGCATGAA
CACTATGCCG GGCTGTGCAC ACTGCTCAGA AAGGCTGCCG GGCTGTTGCC ATCGGGCGAG
ACATCACAGA GCGGTTGTCT CTTTTTTCAG TACCATGAAC GGGAGATCTG CTTTCAGGTA
CTGCTGCTGC AAGGGGCAAA CGGTGACTAC CTCACCATCC GCCAGCATAT CAGCGCCACC
ATTCCGGCAA AATTGGACCT GCTGCAACTG CCCGACTTTC AGAAACAGCA GTTCAGACGA
CTGGCAACCC GACGTCAGGG AATGATCCTG TTTGCCTCCC GCTCGTTGCA GGAACGTTGC
CGTTTCATGG ACCTGATGCT GGAAGAGCTG GACACCGACG GACAATCGGT GCTGATTCTG
GGCAAGGAGC CGGGCCGCAT GAACAAGCGC TTCCCCCGCA TCCCCCTTGC CGGTTCGGAA
GCGGCCAGGG GACGTCTGAT CATGGACAGC CTGGAACATG GCCCGGACAT TCTGGTGATT
GAGGATGGTA CGGCACTGGA GTCATTCACG GCAGCCGGCC GGGCCGCCAT GCGGGGCAAA
CTGGTGCTGG TGGGAATGGA TATCCGCGGC ACCCGCAACC TGTGCGACCA CCTGATCCGC
TTTCGTCAGC GCAACGCCTT TCTGACCCCG TTTCTTTCCG GTATTGTATC ATTCAAAGGC
ATTCAGCTAC TCTGTCCCTC CTGCAGACAG GCCGCCACCA TAGCGCAACA GGAACTACTC
GGCCTGCAGA TGCAGCCACC ACCTGCAGAG CTGTACCATG CAACAGGCTG TCCACAATGT
AACTACACCG GGGTCAGTGA ACGGATTTTT CTGACCAACT GCATCTGTTT TGACCGTGAA
CTGCACGCCC GTTTTGATGC CGCTTCTGAC GGAAGCAGTT TTATCGCCGG ACTTAATGCT
GATGGGTATC GCGGCATCAA GGCGGAAGGC GAGGTTCTCT TGAAGGCGGG AGCTGTATCA
CCGGAAGAGT TCATTGCTGC TGTCATTCAA TAA
 
Protein sequence
MPINVREGTL GAILYNSRII SEADITAALE EQQRSSSRFG EALVSLGIVT QEDIDWALSN 
QLDIPYIRLK QEMIDPEALS LLPPHLCRIH QLIPLIRAGD ELSIAIADPL NKEAVTAAAE
ASGCRINLSV ALIREINEML DLCYGLPRED LLGFSSGLLS PEQLSSINAD SSGQQLINSL
LAYSIQHQLT SFSFRPLEDL ISISGRSGAT SHELGQLTHE HYAGLCTLLR KAAGLLPSGE
TSQSGCLFFQ YHEREICFQV LLLQGANGDY LTIRQHISAT IPAKLDLLQL PDFQKQQFRR
LATRRQGMIL FASRSLQERC RFMDLMLEEL DTDGQSVLIL GKEPGRMNKR FPRIPLAGSE
AARGRLIMDS LEHGPDILVI EDGTALESFT AAGRAAMRGK LVLVGMDIRG TRNLCDHLIR
FRQRNAFLTP FLSGIVSFKG IQLLCPSCRQ AATIAQQELL GLQMQPPPAE LYHATGCPQC
NYTGVSERIF LTNCICFDRE LHARFDAASD GSSFIAGLNA DGYRGIKAEG EVLLKAGAVS
PEEFIAAVIQ