Gene Dgeo_1374 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDgeo_1374 
Symbol 
ID4057533 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDeinococcus geothermalis DSM 11300 
KingdomBacteria 
Replicon accessionNC_008025 
Strand
Start bp1454604 
End bp1456205 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content65% 
IMG OID641230389 
ProductABC transporter related 
Protein accessionYP_604838 
Protein GI94985474 
COG category[R] General function prediction only 
COG ID[COG3845] ABC-type uncharacterized transport systems, ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.534066 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGTGG CCAACAAGGA CGTGCTGCAA GCGGTGCGGC ACAACTCCGA ATACGCGCTG 
GAACTGCGGA ACATCACCAA ACGCTTTCCG CTGGTGCTGG CGAACGACAA TATCTCTATG
CAAGTGCGCT GGGGCAGCGT TCACGCTCTG TGCGGTGAAA ACGGCGCCGG CAAAAGCACC
CTGATGAAGA TCGTGTATGG GGCCCAGCCC CCCACCAGCG GCGAGATCGT GGTGGATGGC
CAGCCGGTCC ACTTCACCGA CCCCTCGCAG GCCATCGCTC ACGGCATCGG CATGGTCTTC
CAGCACTTCA TGCTGGTCGA TACCCTGACG GTCACCGAGA ACGTGATCCT GGGAGCCGAG
CCGCGAGCGG GCACCTCCAT CGACTATGCC GGGGCGCGCC GCCGCGTGGC CGAGCTGATC
GAGCAGTTCG GCTTTGATCT CAACCCTGAC GCGCTCGTGG GCGACCTGCC GGTGGGCCTC
CAGCAGAAGG TGGAAATTCT CAAGACGCTT TACCGCGGCG CGCGCATCTT GATTCTGGAC
GAGCCGACTG CCGTCCTCAC ACCGACCGAG ACAGACGAAC TCTTCGACTT TCTGAAAAAT
CAGTACGCGG CAAGTGGCAA CGCGGTCATT TTCATCAGCC ACAAGTTGCA TGAGGTGCTG
CAGATCAGTG ACACCATCAG CGTCATCCGT GACGGCAAAA TGATCGGCAG CATTCCCGCC
CAGGGCGCGA CCACCGAGAC CCTGGCCCGG ATGATGGTGG GCCGCGACGT GAGCCTGAAG
GTGCATAAGG CCCCCGCCCG GCCCGGCGAG GTGGCCCTCG ATGTCCGCAA CGTCACTGTC
AAGGGTGAAC ACGGCAACGC CGTGGATGGT GTCTCCTTCC AGGTCCGTTC GGGCGAAATC
GTCGGGATCG CGGGCGTGGA GGGCAACGGC CAGAGCGAGC TGGTGGAGGC GATCACCGGC
CTGCTGCCGG TTGCCAGCGG CGAGATCACC TATCTGGGCC GTCACGCGCG CGGCGTGCGC
GAGGTGGAAG CGAGCGGCGT CTCGCACATC CCGGAGGACC GCAACGAGCG CGGCCTGGTG
CTGGAGATGA CCACCGCCGA GAACTACATC CTGGGCGAAC ATGACCGCGC TCCCTTCGCT
GGCCCGCTGG GATTCCTGAA TCTGGAGGCC ATCGAGGAAA ATGCCCGCCA GCTCAGTGAG
AAGTACGACG TTCGCCCCCG CAGCGTCAGC CTGCAAGCGG GCCGTTACAG CGGCGGCAAC
GCCCAGAAGC TGATTGTGGC GCGCGAGATG CGCAAGCAGC CCAAAATCCT GATCGCCTCG
CAGCCTACCC GCGGGGTGGA CATCGGCGCC ATCGAGTTCA TCCACGCCCG CATCGTGGAG
GCGCGCGACC AGGGCCTCGC CGTGCTGCTC GTCAGTGCCG ACCTGGGCGA GGTGATGAAC
CTCTCCGACC GCATCCTGGT GATGTACGAG GGCCGGATCG TGGGTGAGGT GGAGGCCGCC
ACCGCCACCG AGACGCAGCT CGGCCTGCTG ATGACCGGCA GCGGGGGCAC GGGCGGGCGC
AGCGGTGCCG TGAGCGACAC CCAGGAATAC GGCACGCGCT GA
 
Protein sequence
MTVANKDVLQ AVRHNSEYAL ELRNITKRFP LVLANDNISM QVRWGSVHAL CGENGAGKST 
LMKIVYGAQP PTSGEIVVDG QPVHFTDPSQ AIAHGIGMVF QHFMLVDTLT VTENVILGAE
PRAGTSIDYA GARRRVAELI EQFGFDLNPD ALVGDLPVGL QQKVEILKTL YRGARILILD
EPTAVLTPTE TDELFDFLKN QYAASGNAVI FISHKLHEVL QISDTISVIR DGKMIGSIPA
QGATTETLAR MMVGRDVSLK VHKAPARPGE VALDVRNVTV KGEHGNAVDG VSFQVRSGEI
VGIAGVEGNG QSELVEAITG LLPVASGEIT YLGRHARGVR EVEASGVSHI PEDRNERGLV
LEMTTAENYI LGEHDRAPFA GPLGFLNLEA IEENARQLSE KYDVRPRSVS LQAGRYSGGN
AQKLIVAREM RKQPKILIAS QPTRGVDIGA IEFIHARIVE ARDQGLAVLL VSADLGEVMN
LSDRILVMYE GRIVGEVEAA TATETQLGLL MTGSGGTGGR SGAVSDTQEY GTR