Gene Gmet_1901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGmet_1901 
Symbol 
ID3740305 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter metallireducens GS-15 
KingdomBacteria 
Replicon accessionNC_007517 
Strand
Start bp2121374 
End bp2122339 
Gene Length966 bp 
Protein Length321 aa 
Translation table11 
GC content57% 
IMG OID637779193 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_384855 
Protein GI78223108 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence
[TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.00273857 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.00000000000094829 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAACAGAC GCGACTTTCT CAAAACCTCG GGCCTGGCTT TGGCTGCTGC TGCCACTCCC 
TTCGGCCGTT TCGCCTGGGC CGGCGGGGCC ACACTTTCCC TTAAGATCGG TTATCTCCCC
CTCACCGATC ACCTCCTCAT GATTGCCGCC GAGAGGGAGC AGTTCAAAAA CATCCGCATC
AAGCCGGTGA GGTTTTCCTC ATGGCCCGAG ATCGCCGAAG CCCTGAAGAA TGGGAAAATC
GACGGCGGTT TCCTCCTCAC TCCCATGGGG TTGGCGCTCC GCCAAAAGGG GGCACCGATC
AAAGTGGTGC TCCTGGGGCA CCGTAATGGC AGCGCCATAA CGGTAAAAAA TTCACCCGAT
ATCAACCGTA TCACGGATTT GCGGGGCAAG ACCGTCGCCA TACCGAGTCC TTTTTCGACT
CACAACCTGC TGCTGCGCAA GGCCTTGAGC GAGAAGGGGC TGGTGCCGGG GCGGGACGTG
AAAGTGATAG ATATGGCGCC TCCCGAGATG CCGGTTGCAC TCGCCACCGG CCGGATCCAC
GGCTTCATGG TGGCCGAGCC TTTCGGTGCC CAGGCCGAGG CCCAGAAGGT GGGAAAGATC
CTCGTTCTCT CCAAGGATAT CTGGAAAGAC CACATCTGCT GCACCCTTAA CCTGCAGGAA
AAGATAATCC AGAGCTATCC TGCCGAGGTA CAGGAACTGG TTACGGGGCT TATCCGTACC
GCTTCTTTCA TTGAGTCCAA GCCGGCCGAA GCGGCCAGGG GATCGGTAAA GATTCTCGGC
CAGCGCCCGG AGATCGTAGA GAAAGTCCTC ACTACTCCTC AGGATCGCCT CACTTTCCGA
AATTTGGCTC CTTCACGGGC AGATTTCGTC GCCATTCAGG ACTACATGGT GAAGTTTGGG
GTTGCGAAAG CCAAGGTGGA TCTGACTGGC TATCTGGATG ACCGCTTTGC GAAGAGGAAA
GGGTAG
 
Protein sequence
MNRRDFLKTS GLALAAAATP FGRFAWAGGA TLSLKIGYLP LTDHLLMIAA EREQFKNIRI 
KPVRFSSWPE IAEALKNGKI DGGFLLTPMG LALRQKGAPI KVVLLGHRNG SAITVKNSPD
INRITDLRGK TVAIPSPFST HNLLLRKALS EKGLVPGRDV KVIDMAPPEM PVALATGRIH
GFMVAEPFGA QAEAQKVGKI LVLSKDIWKD HICCTLNLQE KIIQSYPAEV QELVTGLIRT
ASFIESKPAE AARGSVKILG QRPEIVEKVL TTPQDRLTFR NLAPSRADFV AIQDYMVKFG
VAKAKVDLTG YLDDRFAKRK G