Gene GM21_3833 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3833 
Symbol 
ID8139207 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4418001 
End bp4420283 
Gene Length2283 bp 
Protein Length760 aa 
Translation table11 
GC content70% 
IMG OID644871450 
ProductTetratricopeptide TPR_2 repeat protein 
Protein accessionYP_003023608 
Protein GI253702419 
COG category[R] General function prediction only 
COG ID[COG0457] FOG: TPR repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value0.00358812 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCTGCTA GCTCCCGCGC GGATAGCCCG GCCGAGACGT TCGCCGCGGC CCTCGATCTG 
CAGAAAAGCG GCCACAGGGG CGAGGCCGAG CGGCTCTACC GCGCCCTGGC GGCATCGGGG
GGGGAGTTGG CCGCCGACGC CTGCATCAAT CTGGGGGCGC TCCTGGACGA GAGCGGGCGC
GCCGAGGAGG CGCTGGAAAA GTACCGCGAG GCGCTCGCCC TGCGGGAGGG GGACCCCCTC
GCCCTAAACA ACGCCGGTTC CACGCTGTTC AAGCTGGGGC GCTTCACGGA GGCGGCGCAA
CTATTCCGCC ATGCCCTGGA GCGGGCCCCG GATTCCCTGG AGGCGCAGGT GGCGCTCGGC
GCGGCGCTGC AAAGGGACGG GGATCTCCCC GCGGCGCTCG CCGTCTTCCG CGATCTGGTG
GCGCGGCGCC CGGATTGCGC CGAAGCGCAC TGGAACCTGG CGCTGGCCCT CCTCCTGGCG
GGTGAGTTCC GCGAGGGGTG GCAGGAGTAC CAGTGGCGCT GGCGCAGGGA CTCCTTCACT
TCGCCCCGGC GCGAGCTTGC GGCGCCCGCC TGGGACGGCA CCCCTCTTCA AGGGCGCCGC
ATCCTGGTGC ACGGCGAACA GGGGCTGGGC GACACCATCC AGTTCGCCCG CTACCTCCCC
ATGGTCGCCG CCGCGGGAGG GGTGGTGGTG GCGGAATGCC AGTCCCCCTC CCTGGTGCCG
CTCTTGCGCT CCATCCCCGG CGTCGCCGAG ACCTGCGTCA TGGGGGAAAC GCTTCCCCCC
TACGACCTCG AGGTCGCGCT CCTGTCGCTC CCCCACCTGT TCGGCACCAC CCTGGAGAAC
GTTCCAAGCG GGGTCCCCTA CCTGGCGCCC CCACAGGACC GGATCGCCCC CTGGCGGGAG
AAGGTGGCGG CGGACCTGGG GTTCAAGGTG GGGCTGGTCT GGGCCGGGAA GCCGGTTCCG
GACCCATTTC GCTCCTGCAC GCTCGCGGCG CTCTCGCCTC TCTTCGACAT CCCCGGGGTG
AGCTTCTATT CGCTCCAGGT GGGTGAGGAG GCGCAACAGG CAAAGGAATT TCCCTCCCTC
ATCGATTTCA CCCCCGGCAT CGCGGACTTC GGCGACACGG CCGCTCTCAT CGCGCAGCTC
GACCTGGTCC TCTCCATCGA CACCTCCGTG GCCCACCTGG CAGGCGCACT GGCGAAGCCG
GTCTGGCTGC TGCTCCCCAA GGCGGGCGAC TACCGCTGGC TCACCGAGCG CGAAGATTCC
CCCTGGTACC CGACCATGCG CCTTTTCCGG CAGAAGCTGC AGGGAGAGTG GGGGGAGGTG
GTCGAACGCG TGAAGGAGGA ACTGGAGCCG TCGGCCTGGG GCTTTTTGGA AAAAGCTGCC
GCGGCGCAGC CGTTCAACGG CCGCAGACAC TACCTCTGCG GGCTCTTCCT CTCCTTCGAA
AAGAGGGAGC GCGAGGCGAC GGTAAGGTAC AGCAAGGCGG CGCAGTTGAT GCCCGGAAGC
TGGGAGCCGC ACTACGCGCT CGCCTGCTCG CTGCAGCAGC TTACGCGACT TGCCGAGGCG
AAGGAGAGCC TTGTGGCGGC GCTCGTCTTG GAGCCGCGCC TTCCCCTCTT GCACGAGGCT
TTCGGCATCC TGTGCCAGAT GCAGGACGAC CCCGAGGGGG CGGCGCGCGC CTACCGGGAG
GCGCTGGCGC TGGACCCGGA CGCGGTCAAG GCACGCTACA ACCTGGCCAC GCTCTGCAAG
GAGAAAGGGC TCGCAGCCGA GGCTCTGCAA GGTTTCCGCG AGGTGGTGCG GCGCGAGCCG
GAGCATGCCG ACGCGCATTG GAACCTGGCC GTGATGCTCC TCATGACCGG GGAGTTCGCC
GAAGGTTGGC GGGAGTTTCC CTGGCGCTTC AAAAAGAGCC TCTCTCCCCC GGTGCGCCGC
TGGGAGGAGC TGCCGCGCTG GGGACGGCTC CCCGCTTGCC GGTGCGACCG TCCTGCTCTA
CGGGGAGCAG GGGGCCGGCG ACACGCTGCA GTTCGTGCGC TACGCCCCGC TGGTGGCAAA
GCGCGGCGGA CGCGTGCTCA TCGAGGTGCA GTCGCGGGGG CTCGTCGAGC TGGTGGCGAC
CGTCGCTGGC GTCAGCGGCG TCTTCGCCTG CGGCGACCCC CTCCCCGCGT TCGAGTGGCA
GGCCTCGCTG ATGGATCTTC CCGGCATCTT CGGCACCGAG CCCGGCACCA TCCCGGCCGC
CATCCCCTAT CTCGTGGTCG ACCCCGGGCG CCGCGACTCG CTGCGCCGTC TCTTCGAGGC
TGA
 
Protein sequence
MPASSRADSP AETFAAALDL QKSGHRGEAE RLYRALAASG GELAADACIN LGALLDESGR 
AEEALEKYRE ALALREGDPL ALNNAGSTLF KLGRFTEAAQ LFRHALERAP DSLEAQVALG
AALQRDGDLP AALAVFRDLV ARRPDCAEAH WNLALALLLA GEFREGWQEY QWRWRRDSFT
SPRRELAAPA WDGTPLQGRR ILVHGEQGLG DTIQFARYLP MVAAAGGVVV AECQSPSLVP
LLRSIPGVAE TCVMGETLPP YDLEVALLSL PHLFGTTLEN VPSGVPYLAP PQDRIAPWRE
KVAADLGFKV GLVWAGKPVP DPFRSCTLAA LSPLFDIPGV SFYSLQVGEE AQQAKEFPSL
IDFTPGIADF GDTAALIAQL DLVLSIDTSV AHLAGALAKP VWLLLPKAGD YRWLTEREDS
PWYPTMRLFR QKLQGEWGEV VERVKEELEP SAWGFLEKAA AAQPFNGRRH YLCGLFLSFE
KREREATVRY SKAAQLMPGS WEPHYALACS LQQLTRLAEA KESLVAALVL EPRLPLLHEA
FGILCQMQDD PEGAARAYRE ALALDPDAVK ARYNLATLCK EKGLAAEALQ GFREVVRREP
EHADAHWNLA VMLLMTGEFA EGWREFPWRF KKSLSPPVRR WEELPRWGRL PACRCDRPAL
RGAGGRRHAA VRALRPAGGK ARRTRAHRGA VAGARRAGGD RRWRQRRLRL RRPPPRVRVA
GLADGSSRHL RHRARHHPGR HPLSRGRPRA PRLAAPSLRG