Gene GM21_1799 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1799 
Symbol 
ID8137130 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2092141 
End bp2093301 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content66% 
IMG OID644869411 
Productprotein of unknown function DUF819 
Protein accessionYP_003021611 
Protein GI253700422 
COG category[S] Function unknown 
COG ID[COG5505] Predicted integral membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones63 
Fosmid unclonability p-value0.0833074 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCACTA ACCCGGTTCT CATCGTAGCC GTCTTGATCT CCATCGAGGC CCTGGTCCTT 
TGGCTGTCGC GGCACGAGCG GACCAGGCGC TTCTTCAACC TCCTCCCCTC GGTCTTCTGG
ATCTATTTCC TACCCATGCT GGCGGCGACC TTCGGGCTCA TCGCCACCAA TAGCCCGGTC
TACGGACTCA TCACCAGCTG GCTCCTCCCC GCAAGTCTCG TGCTGCTCCT TTTGCCGGTC
GACATCAAGG CGATCCTGAG GCTCGGCCCC ACGGCGATCG CCATGTTCTT CATCGGCGCG
GCCGGGATAA TCGCCGGCGC CGCCCTCTCC TTCTCCCTTT TCAAGCCGGT GATCGGAGCC
CGGTTCTGGT CCGGCTTCGG GGCGGTCTCC GCTTCCTGGA CCGGCGGCAG CGCCAACATG
ATAGCGGTGA AGGAGGCGCT TTCGGTCCCG GACGAGGTCT TCGCGCCCAT GGTGATCGTG
GACACGGTGG TCCCCTACCT CTGGATGGGT TTCATGATCG CCATCGTCGG GGTGCAGCCG
GCCTTCGACC GCTGGAACCG CTCGAACCGA GCCACGCTGG ACCACCTCGG GGAAGAGGCC
GTGCGCTACC TGGCCACCGC CGGCGGCCGC CGCACCTTGA GCGGGATAGC GATATCCCTT
GCCGTCGCCC TGGCAGGCGG GGGGGGCGCG CGCCTCATCG GCGAACAGAT GCCCCGGGTC
AACGTGCTTA CCGGTTACAC CTGGACCATC ATGATCGTGA CCCTCTTGGG GATACTTCTC
TCCTTCTCCC CTTTGCGCCG CCTGGAACGC TCGGGCGCTT CGCGGACCGG CTACGATCTC
CTATACTTCG TCCTCACCGC CATCGGCGCC AAGGCGTCGG TCGCGGACAC CGGCTCGGCG
CTGGTGCTTA TCGGCGCGGG GATGCTCATC GTGGCGGTGC ACGCCGTCTT CCTGCTCATC
GGAGCGCGCC TTTTGAAGGC GCCGATGTTC CTCGTCGCCG CCGCAAGCCA GGCCAACGTA
GGCGGCGTCG CCTCCGCGCC GGTGGTGGCC GAGGTGTACC ATCCCGGCCT CGCCTCGGTG
GGGCTGCTTC TCGCCATCCT GGGGAACATC GTCGGCACCT GGCTCGGCAT CCTAGCCGCC
CAGCTCTGCC GCCTGCTGTG A
 
Protein sequence
MITNPVLIVA VLISIEALVL WLSRHERTRR FFNLLPSVFW IYFLPMLAAT FGLIATNSPV 
YGLITSWLLP ASLVLLLLPV DIKAILRLGP TAIAMFFIGA AGIIAGAALS FSLFKPVIGA
RFWSGFGAVS ASWTGGSANM IAVKEALSVP DEVFAPMVIV DTVVPYLWMG FMIAIVGVQP
AFDRWNRSNR ATLDHLGEEA VRYLATAGGR RTLSGIAISL AVALAGGGGA RLIGEQMPRV
NVLTGYTWTI MIVTLLGILL SFSPLRRLER SGASRTGYDL LYFVLTAIGA KASVADTGSA
LVLIGAGMLI VAVHAVFLLI GARLLKAPMF LVAAASQANV GGVASAPVVA EVYHPGLASV
GLLLAILGNI VGTWLGILAA QLCRLL