Gene Athe_0789 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0789 
Symbol 
ID7407976 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp878963 
End bp880624 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content38% 
IMG OID643715167 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_002572677 
Protein GI222528795 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.109867 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTCTTA TAGAAATGAC TATTCCAGAC TATTTTGATA TTATAGCTAC AAAGTTTGCT 
GACAATCCCG CTGTCATTTA CCATCATGAA AAGATTTACC TTACATATTC CCAGTTCAAA
AAAATGGTTG ATGATACAGC AAAAGGGTTT ATGGCAATTG GGATTCAAAA AGGAGAACAT
GTAGCTGTAT GGGCAACGAA CAGGCTTGAA TATCTCATTT CTATCTTTGC TTTAGCCAAG
ATTGGAGCAG TGCTTGTTAC TGTAAATACC AACTATAAGA TATATGAGCT TGAGTATCTT
CTCAGACAAT CTGACAGTTC CACTTTAATA TTCACAGAAG GATTTAAAGA TTCAAATTAT
CTTGAGATTG TTAAAAAACT TAATCCTCAG CTTCAGGTGT GCAAAAAAGG AGAACTTGAA
AATCCTAATC TGCCTTATCT TAAAAGACTA ATTTTTATCG GGCAAGACTC TCATGATGGA
ATATACAACT GGCACGAGGT AATTGAACTT GGAGAGAATA TCCCTGATGA GGACCTCATT
CAAAGACAAA AAAGCCTTGA GCCAGATGAA GTAATAAATA TGCAATACAC TTCTGGTACC
ACTGGATTTC CAAAAGGTGT TATGCTTACA CACAAAAATA TTCTCAACAA TGCAAACACT
ATAGCAGATT GTATGAAACT TACACACAAG GACAAGCTGT GCATCCCCGT TCCATTTTTC
CATTGTTTCG GACTTGTTTT GGGTATAGGT GCATGCGTGA CAAAAGGTGC CACCATGGTA
CCGCTTGACC ATTTCAATCC TCTTAAAGTT ATGGAGACAG TCCACTTTGA AAGATGCACT
GGCTTACATG GTGTGCCAAC AATGTTTATT GCAATATTGG AGCATCCTGA ATTTAATAAG
TTTGATTTTT CTTCTCTTCG TACTGGTATA ATGGCAGGAG CACCTTGCCC TATCAAGGTT
ATGAGAGAAG TTGTCGAAAA AATGCACATG AAAGAGATTA CAATAGCATA TGGTCAAACT
GAGGCATCAC CTGTAATAAC TCAGACAAGG GTTGACGACC CTCTTGAGTT TAGGGTGTCT
ACAGTTGGAA AACCACTTGA AGGTGTGGAG GTAAAAATTG TGGATATCCA CACTAAAAAA
GAGGTTCCAA ACGGTGTTAT TGGCGAGATA TGTGCAAGGG GATACAACGT TATGAAAGGG
TATTACAAGA TGCCAGAGGC AACAAAACAA GCCATCGACG AGGATGGTTG GCTTCACACA
GGTGATTTAG GATACATTGA CCAAAATGGA TATTTAAGAA TTACTGGTAG GCTCAAAGAT
ATGATAATAA GAGGTGGAGA AAACATATAT CCACGTGAAA TAGAGGAGTT TTTATATACA
CATCCGGCAG TGAAAGATGT GCAAGTTGTA GGTGTACCAG ATAAAGTCTA TGGTGAGGAG
ATAGCTGCAT TTATAATCCT CAAAGATGGG TGTTATGCAA GCGAGGAAGA GATAAAAGAG
TTTGTAAAAG CAAATCTTTC ACGGCACAAA ACGCCACGAT ACGTTGTGTT TGTTAGCGAG
TTTCCCACAA CTGCAAACGG AAAAGTACAA AAATATAAAC TAAGAGAGAT GGCTATAGAG
ATGTTTGGTC TTCACGATGC GGCAAATATC GAAACAGCTT AA
 
Protein sequence
MSLIEMTIPD YFDIIATKFA DNPAVIYHHE KIYLTYSQFK KMVDDTAKGF MAIGIQKGEH 
VAVWATNRLE YLISIFALAK IGAVLVTVNT NYKIYELEYL LRQSDSSTLI FTEGFKDSNY
LEIVKKLNPQ LQVCKKGELE NPNLPYLKRL IFIGQDSHDG IYNWHEVIEL GENIPDEDLI
QRQKSLEPDE VINMQYTSGT TGFPKGVMLT HKNILNNANT IADCMKLTHK DKLCIPVPFF
HCFGLVLGIG ACVTKGATMV PLDHFNPLKV METVHFERCT GLHGVPTMFI AILEHPEFNK
FDFSSLRTGI MAGAPCPIKV MREVVEKMHM KEITIAYGQT EASPVITQTR VDDPLEFRVS
TVGKPLEGVE VKIVDIHTKK EVPNGVIGEI CARGYNVMKG YYKMPEATKQ AIDEDGWLHT
GDLGYIDQNG YLRITGRLKD MIIRGGENIY PREIEEFLYT HPAVKDVQVV GVPDKVYGEE
IAAFIILKDG CYASEEEIKE FVKANLSRHK TPRYVVFVSE FPTTANGKVQ KYKLREMAIE
MFGLHDAANI ETA