Gene Dgeo_2519 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDgeo_2519 
Symbol 
ID4073750 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDeinococcus geothermalis DSM 11300 
KingdomBacteria 
Replicon accessionNC_008010 
Strand
Start bp529197 
End bp530885 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content67% 
IMG OID641228956 
Productlong-chain-fatty-acid--CoA ligase 
Protein accessionYP_594027 
Protein GI94971987 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCAAG ATTCTTTCTC GGCTCTTCCC CATCCCGCCG CCACGCCGAC TGCTGGGTCC 
GCCTGGCCCG CGGGCAAGCC GCGCCACCTG ACGCTGCCGC GCACCGGCCT GATGCACAAC
CTGCGGGTGA GCGCCGAGCG CTATCCGGAC AAGACCGCCC TGTGGTTTTA TGGCCGTGAG
CTGAGCTACC GCGAGCTGCG CGAGCAGGCC GAGCGTCTGG CTGGGCACCT GGCCGCGCAG
GGCGTGCAAA AGGGTGACCG GGTCGCGGTG TGGCTGCAAA ACAGCCCTGC CTGGGCGGTC
GCGGCCCACG CCGCCTGGCA GCTCGGCGCG GTGGTGGTGC CGCTCGCGCC GATGCTGCAA
GCCCGTGAAT TCGCCTACTT CCTGGGCGAC GCAGGCATCC GGGTTGGCGT GGTGGGGGCC
GAACTGTACG AGCGGGCCAA ACAGGGCGGC CTTGAACACG CGGTTGTCGC CAATATCATG
CGCGGCACCG ACCCCGCAAA GGCAGGGATT CCGCTGCCGA GCGGACTGGA CGTGAACCCC
GAGCTGCAAG CGGGCGATGT AACGCTGGAA GAGGCCCTGA AGGCAGATGC TGCCCCCGCC
GCCGAGATAG GGCCTGATGA CCTGGCGGTG ATGCCCTATA CCAGCGGCAC CACCGGAACG
CCCAAGGGCT GCATGCACAC GCACGGGACC GTGCAAGCGA ATGTGTTCGG CGCGGGCGCC
TGGGTCGACG GCACGGTGGA AGACGTGTTT CTGGCGAGCT TGCCCTTCTT CCACGTAACC
GGTTTCGTCA ACAGCCTGCT CGCGCCCATC AACGGCGGCG GCAAGATTGT GATCATGGCC
CGTTGGGACC GTGATGCAGC ACGTGAACTG ATCCGTGACC AGGGCGTCAC CCTCTGGACC
AATACCGCGA CCATGGTGAT TGACCTGCTG GCCTCCCCGC ATTTCAATCC CTCGGACCTC
CGCAGCCTGC GCAACGTGAC GGGTGGCGGG GCCAGTCTCC CGGCGGCGAT TGGCCAGCAG
CTCCTCGACC AGACCGGCCT CACCTTCTGT GAGGGCTACG GCCTGACGGA GACGATGGCG
CAGACCCACT CCAACCCCAA GAGCCGCCCC AAGCTCCAGT GTCTGGGGAT CCCACTGTTT
GATGTCGATG CCCGGGTGGT GGACCTCGAC ACCGGCGAGG AACTTCCGGT GGGCGGCGTG
GGCGAGATCG TGATTCACGG TCCCCAAGTG ATGAAGGGCT ACTGGAACCG CCCCGAGGCG
ACCGCTGCGG CGTTCATGGA ACTGGACGGC AAACGTTTTT TCCGCACCGG CGACCTGGGC
TACCGCGACG AAGAGGGTTA TTTCTTTTTC ACCGATCGCC TCAAGCGCAT GGTGAACGTC
TCGGGCATGA AGGTGTGGCC CGCCGAGGTC GAAAACACGC TGCACGGGCA CCCCGCCGTG
CAAGAAGCCT GCGTGATCGC GGTGCCCGAT GAGCGCACCG GCGAACGCGC CCGCGCCCTG
ATCGTGCTGA AGCCCGGCCA ACAGGTGACC GGCGAGGAGA TCGAAGCGTG GGCCAGGACG
CAGATGGCGA CCTACAAGGT GCCGCGCGAC TATGTGTTCG TGGAGAGCCT GCCGCGCGGC
GCGACGGGCA AGGTGGCCTG GCGACAGCTC CAGGAACAGG CTCGCGCGGA GCTGGGTGCA
CAGAAGTAG
 
Protein sequence
MTQDSFSALP HPAATPTAGS AWPAGKPRHL TLPRTGLMHN LRVSAERYPD KTALWFYGRE 
LSYRELREQA ERLAGHLAAQ GVQKGDRVAV WLQNSPAWAV AAHAAWQLGA VVVPLAPMLQ
AREFAYFLGD AGIRVGVVGA ELYERAKQGG LEHAVVANIM RGTDPAKAGI PLPSGLDVNP
ELQAGDVTLE EALKADAAPA AEIGPDDLAV MPYTSGTTGT PKGCMHTHGT VQANVFGAGA
WVDGTVEDVF LASLPFFHVT GFVNSLLAPI NGGGKIVIMA RWDRDAAREL IRDQGVTLWT
NTATMVIDLL ASPHFNPSDL RSLRNVTGGG ASLPAAIGQQ LLDQTGLTFC EGYGLTETMA
QTHSNPKSRP KLQCLGIPLF DVDARVVDLD TGEELPVGGV GEIVIHGPQV MKGYWNRPEA
TAAAFMELDG KRFFRTGDLG YRDEEGYFFF TDRLKRMVNV SGMKVWPAEV ENTLHGHPAV
QEACVIAVPD ERTGERARAL IVLKPGQQVT GEEIEAWART QMATYKVPRD YVFVESLPRG
ATGKVAWRQL QEQARAELGA QK