Gene GM21_1418 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1418 
Symbol 
ID8136746 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1667331 
End bp1668488 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content63% 
IMG OID644869032 
Productprotein of unknown function DUF185 
Protein accessionYP_003021235 
Protein GI253700046 
COG category[S] Function unknown 
COG ID[COG1565] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value4.54059e-28 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCCGATG GCGCAACTAC TAAACTTGCC GAAATCATCC TTAACCGCAT CCGCACCAGC 
GGGGACATAA CCTTTGCTTC CTTCATGGAC GCGGCTCTCT ATGAGCCGGA CCTTGGCTAT
TACACTTCCG CGGGCCGCAA AGTCGGAGCT GAGGGGGACT TCTACACCAG CATGAACGTC
CACAGCGCTT TCGGACGGCT CATCGCTCAG GAGATCTGCC GGTTCTGGGA AGTCCTCGAC
TCCCCCGCCT CTTTCACCAT CGCCGAGGCA GGCGCGGGCG GCGGACAACT GGCGCAGGAC
ATTCTGGACG CCATCAGCGA AGACAACCCC GCTTTCTACA GCGGCCTCAC CTATCGCCTC
ATTGAGAAGG AACCTTCCCT GCAGCAGGCC CAGGCCGCGC GCCTGTCGCG CCACGCGGAC
CGGCTCGCCT GGAGCTCGCC GGACGAACTC GCAGCGGGGA CGCTCTCCTT CACCGGCTGC
ATCATCTCCA ACGAACTTTT CGACGCCATG CCGGTACACA TCGTGGAACT GACCGAGGCG
GGACTGCGGG AGGTGTACGT ATCCGCCGAT GACAACGGCT TCGTCGAAAG GCTGCTCCCC
CCGTCCACCC CGGAGTTGGA GCAGTACCTG CGCAAGTACG AGGTAAGGCT CCTGCCGGGC
CAGCGCGCGG AGATCAACCT CGCCGCTTCC GGCTGGATCG CACAGGCGGC AGCCACCCTC
ACCCGCGGCT TCGTGCTCAC CATCGACTAC GGCTACCTCT CCGGGGAGCT CTACACGCCG
CAAAGGAAAA ACGGGACGCT TCTCTGCTAC TACAAGCACT CCACCAACGA AAACCCCTAC
CAACTGGTGG GTGAGCAGGA CATCACCACC CACATCAATT TCAGCCAGCT CATCGTCGAC
GGCGAGGAGT CGGGGCTCAA GAAGGCGTGG TACGGCGAGC AGTACCGCTT CCTCCTGGCC
GCGGGGCTCA TGGAGGAGCT GATCAGGCTG GAGGCGCAGG CCAAAGACGA GCAGGAAAGC
CTCAAGCACC GACTGGCGCT CAAGAAGCTG ATGCTTCCCG AGGGGGGGAT GGGCGACACC
TTCAAGGTGC TGATCCAGTC CAAGGGGGTC GACAACCCGC AGCTTCTCTG CATGAGGAAA
TGGGGGATGG GGCTGTGA
 
Protein sequence
MADGATTKLA EIILNRIRTS GDITFASFMD AALYEPDLGY YTSAGRKVGA EGDFYTSMNV 
HSAFGRLIAQ EICRFWEVLD SPASFTIAEA GAGGGQLAQD ILDAISEDNP AFYSGLTYRL
IEKEPSLQQA QAARLSRHAD RLAWSSPDEL AAGTLSFTGC IISNELFDAM PVHIVELTEA
GLREVYVSAD DNGFVERLLP PSTPELEQYL RKYEVRLLPG QRAEINLAAS GWIAQAAATL
TRGFVLTIDY GYLSGELYTP QRKNGTLLCY YKHSTNENPY QLVGEQDITT HINFSQLIVD
GEESGLKKAW YGEQYRFLLA AGLMEELIRL EAQAKDEQES LKHRLALKKL MLPEGGMGDT
FKVLIQSKGV DNPQLLCMRK WGMGL