Gene GM21_0553 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0553 
Symbol 
ID8135864 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp678066 
End bp679376 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content64% 
IMG OID644868166 
ProductXanthine/uracil/vitamin C permease 
Protein accessionYP_003020385 
Protein GI253699196 
COG category[R] General function prediction only 
COG ID[COG2252] Permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.000000333886 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCTCCC GCATAGCAGC ATACTTCCAG TTCCAGCGCT ACGGCACCGA CATGAAGCGT 
GAGGTCATCG CCGGGCTCAC CACCTTCCTC ACCATGGCGT ACATCATCAT AGTCAACCCG
GCCATCCTGG AAAACGCAGG CATCCCCCGC GGCCCCTCCA CCACCGCGAC CATCATCGCC
GCCGTCTTCG GCACGGTGCT CATGGCTTTC TTCGCCAACC GCCCCTTCGC CATCGCCCCC
TACATGAGCG AAAACGCCTT CATCGCCTTC GTGGTGGTGA AGGTGATGGG GTATTCCTGG
CAGACCGCGC TGACCGCCGT CTTCTTCGCC GGCATCCTCT TCACCCTGCT GACCCTGTTC
AAGGTGCGGA GCTGGCTCGC CGAATCGATC CCGCTGTCGC TCAAGTGCGC CTTCGCCGCC
GGCATCGGAC TCTTCCTCAC CTTCATCGGC CTGAACGAGA CCGGCATCGT GGTGCTGGGT
GTTCCCGGCG CGCCGGTGAA GCTTGGCGAT CTCTCCCAGC CCTCGGTGCT CCTGGCCGTC
TGCGGCCTGC TCCTGACCGT GGTGCTGATC TCCCGCAAGG TGCTGGGGGC CATGATGATC
GGCATCGTCG CCACCACGCT CGCCTCGATC GCGCTCAAGG TGACCCCGCT GCCGCACTCC
TTCGTCAGCC TGCCGCCGGA CATCTCCCCG ATCCTGTTCC AGCTCGATTT CGCTGGGGCG
CTCTCGCCGG GGTTCTTCCC GATCATCCTG ACCATCTTCA TCATGGCCTT CCTCGACACC
GTCGGGACGC TGCTGGGCCT TTCCATGCGG GCCGACCTTC TGGACGAGAA GGGAAACCTT
CCCGAGATAG AGAAGCCGAT GCTCGCCGAC GCGCTGGCGA CGGTCGCGGC GCCGCTTCTG
GGGACCACCA CCACCGGCGC CTACATCGAG AGTGCGGCAG GGATCGAGGA GGGGGGGCGC
ACCGGCTTCA CCGCGCTGGT CACCGCCTTC TTCTTCCTTC TGGCCCTCTT CTTCGCGCCC
CTTTTCACCG TGGTCCCGGC GCACGCCTAC GGCATAGCCC TGATCGTGAT CGGCTCCTTC
ATGATCAGCC CGCTGGCGAA GATCGACTTC GACGACTTCA CCGAACAGAT CCCCGCCTTC
CTGACCGTGG TGCTGATGAT CTTCACCTAC AACATCGGGG TCGGCATGAC CGCCGGTTTC
ATCGCCTATC CGCTCATGAA GGGCGCCACG GGACGCATCA AGGAGATGAA AGGGGGGATG
TGGGTGCTCG CCCTGCTCTC CCTCTCCTAC TTCATCTTCG GGGCGAAGTA G
 
Protein sequence
MTSRIAAYFQ FQRYGTDMKR EVIAGLTTFL TMAYIIIVNP AILENAGIPR GPSTTATIIA 
AVFGTVLMAF FANRPFAIAP YMSENAFIAF VVVKVMGYSW QTALTAVFFA GILFTLLTLF
KVRSWLAESI PLSLKCAFAA GIGLFLTFIG LNETGIVVLG VPGAPVKLGD LSQPSVLLAV
CGLLLTVVLI SRKVLGAMMI GIVATTLASI ALKVTPLPHS FVSLPPDISP ILFQLDFAGA
LSPGFFPIIL TIFIMAFLDT VGTLLGLSMR ADLLDEKGNL PEIEKPMLAD ALATVAAPLL
GTTTTGAYIE SAAGIEEGGR TGFTALVTAF FFLLALFFAP LFTVVPAHAY GIALIVIGSF
MISPLAKIDF DDFTEQIPAF LTVVLMIFTY NIGVGMTAGF IAYPLMKGAT GRIKEMKGGM
WVLALLSLSY FIFGAK