Gene GM21_2015 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2015 
Symbol 
ID8137349 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2337474 
End bp2338538 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content63% 
IMG OID644869628 
Productprotein of unknown function UPF0118 
Protein accessionYP_003021825 
Protein GI253700636 
COG category[R] General function prediction only 
COG ID[COG0628] Predicted permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.00000000000281213 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGATAAAA AACTCTACCT AAGCCTCATC GCTGCCTTTT TCACCGTTGC AGCGATAGCG 
GCGATAGTCC TCTTTGCCGC TCCCATATTG AAGCCGCTGG CCTGGGCTCT CATCATCGGC
ATCGCCACCA TGCCGCATTA CCAGCGCATC CTGAACCGCT TCCCCGATCG GCCGGGGCGC
GCCTCGGGCC TCATGCTGCT CGCCGTGGCC GTCTGCCTGG TGCTGCCGGC GTCGTGGCTG
GTGATCACCG CAGCCGTCAA CGCTCCTGAG TGGTACCGGC AACTGGAGCA GATGATCCAG
GAGGTCACCA GGACCAGTTC CGGCGCCCTC AGCCAGATTC CCTACTACGA CCGGATTATG
TCCCTCGTGG AGCGCTTCGG CATCGATCTG GGCAATATCG GCGGCAAGAT CGCCTCCAGC
GGCTCGACCG TGATACTGAA CGCCGCGACC AACATGGTGC GCAACCTGTT CGACTTCATC
TTCACGCTGC TGGTGGCCCT GTTCCTGCTC TTTTTCATCT ACCGCGACGG CGAGCGGGCC
GTCGCGCTTT GCATCGGCAA ACTGGCACCC AACCCGCGCA AGGCCCAGCA CTACGCGACC
CAGATCCGCT CCATCACCAC GGCCGTCGCC GTCGGTACCA TACTGACCTG CTGCACCCAA
GGTGTCATCG CTGGACTCGG ATACTGGGTT GCAGGAGTCC CGGCCCCGGT TTTCTTCGCG
GCGCTGACCG CCATCGCCGC CCTGATACCC GTTGTCGGCA CCGCCATCAT CTGGGTCCCC
ATAGTTGCCC TGACCGCTGT AACCGGCTCC TACCTCACCG CTCTCCTTCT GGCGCTTTGG
TGCGTCTTTT TCGTCGGCTT CTCGGACAAC GCCATACGTC CGCTTGCCAT AGGCGCGGCC
AGCGACATCT CGGTGCTGGC TGTGGTCACC GGCGCCCTTT GCGGCGTCGT CATGATGGGG
CTTCTGGGCC TGATCATCGG GCCGGTGATC TTCGCCGTAC TGTTCAGCAT GTGGGACGAC
GCGGTAAGCG CAGCGGGAGA CACCGAGTAC AACGATGTCC CCTGA
 
Protein sequence
MDKKLYLSLI AAFFTVAAIA AIVLFAAPIL KPLAWALIIG IATMPHYQRI LNRFPDRPGR 
ASGLMLLAVA VCLVLPASWL VITAAVNAPE WYRQLEQMIQ EVTRTSSGAL SQIPYYDRIM
SLVERFGIDL GNIGGKIASS GSTVILNAAT NMVRNLFDFI FTLLVALFLL FFIYRDGERA
VALCIGKLAP NPRKAQHYAT QIRSITTAVA VGTILTCCTQ GVIAGLGYWV AGVPAPVFFA
ALTAIAALIP VVGTAIIWVP IVALTAVTGS YLTALLLALW CVFFVGFSDN AIRPLAIGAA
SDISVLAVVT GALCGVVMMG LLGLIIGPVI FAVLFSMWDD AVSAAGDTEY NDVP