Gene GM21_3370 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3370 
Symbol 
ID8138737 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3902392 
End bp3903768 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content66% 
IMG OID644870988 
Productaromatic hydrocarbon degradation membrane protein 
Protein accessionYP_003023153 
Protein GI253701964 
COG category[I] Lipid transport and metabolism 
COG ID[COG2067] Long-chain fatty acid transport protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones87 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAAAAT CGGTGTGCTG TGCGGGCGTG GCCTGCGCCT TCTTGGGGGG GGCCGGCATC 
TGCCACGCGG CGGGGTTCAA GGTGAGCGAG CAGGGGGCCA AGGCGATGGC TATGGGAAAC
GCCTTCGCGG CACAGGCGGA CGACCCGAGC GCCCTGTACT TCAACCCCGC CGGCATCTCG
TTTCTGCGCG GCGCGCAGGC GAACCTCGGC TCGTTGGCCA TACTGGTGCC CCAGACCGAG
TTTCACGGCA CCACGCCACT GAGCGGCACC CCTCCCCTGG ACATCGGGAC TGCCCATGTG
ACGGATAAAT CCAGAAGGGA CATCGTGGTA GCCCCGACCC TCTACGCCAC CTACAGCATG
GAGACGCTCC CGCTTTCCTT CGGCCTGGGC GTCAACGCGG TGTACCCACT GACCAAGAGC
TGGGACGACT CCAGCGTCTT CAGGAACCAG GTGCAGACCG CCTCGATCAA ACCGGTCAAC
TTTCAGCCGA CGGTGGCGTA CCGTTTCGAC GACCTGAAGC TGGCGGTAGC CGGGGCTCTC
GATGTCACCT ACGCCGTGGT CTCGCTGCAG AAGACGGCTT ATGCCCCCGC CATAGATCCC
TCCGCGCCGG CTCCCCCCTT CGGCGCCTAC GAGCTCGGAT CGCTGGGGCT GGACGGGACG
GCTACAGGCG TTGGGTACAA CTTCGGGATC CTCTGGAAAC CGCGGCCGCA GTACAGCTTC
GGCGTGGCCT ACCGGAGCCG GATCACCCTC GACGTCAACG GCGACGCCAA TTTCCTCGCC
ACCACCCCCA CCGGCCTTGG GGCCACCGGC CTTGGGGCCA TCGGCCTCTC GGAGGCCTCC
CCCTTCCCCT ACACCAGGGC CCGCGCCGCC AGCGCCGCAT CGACCCGGAT CGTCCTACCA
GACACCCTGG ACGTGGGCAT TGCCTGGCGC CCCACGGAAA AACTTACTTT AGAGTTCGAC
GCCACCCGGA CCGGCTGGAG CAGCTTCGAC CAGTTGCTGA TCGAGTTCGA CTCCCCTGGG
TTCGCGTCCT TCAACAACCG GCCGGACCCC AGGAACTGGC GCGACGTCTG GGCCTACAAG
TTCGGCGGAC AATACTCCCT GAACGACACC CTCGACCTGC GCGCCGGCTA CTCCTTCGAC
AACACCCCCG TTCCCGATGC CACCCTCGAT CCGCTGCTCC CCGACGCCGA CCGCCACAGC
TTCGCCGTCG GCGCCGGCAT TCACCACAGC TTCGGTATCC TCGACCTCGC CTACATGTGG
GTGCACTTCG TGGACCGCAC GGTCGACAAC CAGGACATGG CGGCGCTGCG CGGGTCCAAC
GGCACCTTCA AAAGCGACGC CTACCTGTTG GCCGCGAACC TGAACTTTAA ATTCTGA
 
Protein sequence
MIKSVCCAGV ACAFLGGAGI CHAAGFKVSE QGAKAMAMGN AFAAQADDPS ALYFNPAGIS 
FLRGAQANLG SLAILVPQTE FHGTTPLSGT PPLDIGTAHV TDKSRRDIVV APTLYATYSM
ETLPLSFGLG VNAVYPLTKS WDDSSVFRNQ VQTASIKPVN FQPTVAYRFD DLKLAVAGAL
DVTYAVVSLQ KTAYAPAIDP SAPAPPFGAY ELGSLGLDGT ATGVGYNFGI LWKPRPQYSF
GVAYRSRITL DVNGDANFLA TTPTGLGATG LGAIGLSEAS PFPYTRARAA SAASTRIVLP
DTLDVGIAWR PTEKLTLEFD ATRTGWSSFD QLLIEFDSPG FASFNNRPDP RNWRDVWAYK
FGGQYSLNDT LDLRAGYSFD NTPVPDATLD PLLPDADRHS FAVGAGIHHS FGILDLAYMW
VHFVDRTVDN QDMAALRGSN GTFKSDAYLL AANLNFKF