Gene GM21_2174 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2174 
Symbol 
ID8137510 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2542514 
End bp2543746 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content61% 
IMG OID644869789 
Productaromatic hydrocarbon degradation membrane protein 
Protein accessionYP_003021984 
Protein GI253700795 
COG category[I] Lipid transport and metabolism 
COG ID[COG2067] Long-chain fatty acid transport protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones125 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCAAA AGAAAGGCCT GCTGGTCGCG TTGTTGACAC CCATACTTGC TTTCAGCGGC 
GCAGCGGCGG TCCATGCTTC CGGTTACGCA GTATTCACGC ACGGCGCCTC TGCGTTGGGG
CAAGGCAACG CCGTGACAGC CCATAGCAGC GATCCCAGCA CCATTTTCTA CAACCCCGCC
CTTATGAACA AGCTGGAGGG GACCCAGGTC CAGGTAGGGA CCACCGGCGT GTTCTCTTCC
CGCCAGTACG AGCCCGCAGG CGCCGGCAAC GGAACCTCCA GCGACTCAGC CTTCTTTCCG
AGCCACTTCT ATGCGACCCA CAAGCTGAAC AACCAGGTGA GCGTCGGCCT GGGCATCTTC
AACCCCTTCG GTCTGGGGAC CGAGTGGGGC GAGCAGTGGG ACGGGCGCTA TATCGCCACC
AAGTCGACGC TGAAGACATT CAACATCAAC CCTGCCGTTT CGGTTCAGGT CACCCCCAAG
CTGGCTCTCG CCGCCGGGGT CGACGTAGTC CTGCTCGACG CGAGCCTGGA GAAGAAGGTC
CCCTCCGGCG CGCTCGGCCT GACTCCCGAC TTCGATGTGA ACCAGAAGTT CAAAGGTGAC
GGCAAGGGAG TAGGTTTCAA CGTCGGGGTA GCATACGACG TCTGCGACCA CTTCTCTCTT
GGCGCCTCCT ACCGCAGCGA GGTGAAGATT GACGTGTCGG GCGATGCCGC ACTTACCGCT
GGCGGGGCTT CGATTGCGTC CTTCGGGGGG AATACGGACC TGACCCTGCC GCCGCAGTTC
ACCGCAAGCG CGGCCTTCAA AGGGATCGAC AAGCTAGTGC TCGAAGCGGG AGTGCGCTGG
GAGGGATGGT CCAAGTTCAA GGAACTAGCG ATCCATGTCG ACAACGGCCA GCCTGCCGTC
ACCCCTAGAA ACTGGAAGGA CAGTTGGGGG GGGAATGCAG GGGGCAGGTA CCAGATGAAC
GACACCGTCG CGCTTCTGGC CGGCTACGTA TACGGCGACA CCCCCGTTCC CGACAGCACC
TTCGACCCGT CGATACCCGA TGCCAAGACA CATGTTTTCT GCGCCGGCAC CGATTTGAAC
TTCAATCCGG TCACCGTGGC GTTTTCCTAC GGCTACCAGC TGCTGGAAAA CAGGCACAAG
GATAACGGGA TACCGGCCGG AGCCGCCCTG GCAAACGGCA AATACAAGAC CGACGCGCAC
CTGGTAGCGC TCTCCGTGGG ATACAAGTTC TAA
 
Protein sequence
MNQKKGLLVA LLTPILAFSG AAAVHASGYA VFTHGASALG QGNAVTAHSS DPSTIFYNPA 
LMNKLEGTQV QVGTTGVFSS RQYEPAGAGN GTSSDSAFFP SHFYATHKLN NQVSVGLGIF
NPFGLGTEWG EQWDGRYIAT KSTLKTFNIN PAVSVQVTPK LALAAGVDVV LLDASLEKKV
PSGALGLTPD FDVNQKFKGD GKGVGFNVGV AYDVCDHFSL GASYRSEVKI DVSGDAALTA
GGASIASFGG NTDLTLPPQF TASAAFKGID KLVLEAGVRW EGWSKFKELA IHVDNGQPAV
TPRNWKDSWG GNAGGRYQMN DTVALLAGYV YGDTPVPDST FDPSIPDAKT HVFCAGTDLN
FNPVTVAFSY GYQLLENRHK DNGIPAGAAL ANGKYKTDAH LVALSVGYKF