Gene Arth_1080 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1080 
Symbol 
ID4446418 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp1166725 
End bp1167984 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content68% 
IMG OID639688886 
ProductFormyl-CoA transferase 
Protein accessionYP_830574 
Protein GI116669641 
COG category[C] Energy production and conversion 
COG ID[COG1804] Predicted acyl-CoA transferases/carnitine dehydratase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.982565 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCACCG TGACCCTGGA AACGGCAGAG CTGGAACGCA CGACGGCGGC GCCCGCCGCC 
GCGGGGGAAC CCACCGCCAC CCCGCTGCCG CTGGACGGCA TCAAGATCGT GGACTTCACC
CAGGTGTTCA TGGGCCCGTC CTGCACGCAG ATGCTGGGCG ACTATGGCGC GGACATCATC
AAGGTGGAAC GCCCCGGCGC CGGGGACATC TCACGCAACT CGTTCCCGGA CAAGGACGGC
CAGGACAACC CGATCTTCCT GTCCATCAAC CGGAACAAGC GCAGCGTCTC CATCGACACG
CGCACCGAGG AAGGCCGGAA CGTGCTGCAC GCCATCATGG CGGACGCCGA CGTGGTGGTC
AGCAACTTCC GCTCCGGTGT GATGGAGCGG ATGGGCTTCG GCTACGAGGA ACTCAAGGCC
GAGAACCCCG GCATCATCTG GGCCTCGGGC ACCGGCTTCG GCCCCGTGGG CCCGTACTCG
CACAAGGGCG GCCAGGACGC GATCGCGCAG GCCTACTCCG GTGTGATGTG GCGGCGGGAA
TCGGACGACC AGAAGCCGTC CATCTACCCC ACCACCCTCT GCGACTACAT CACCGGCATG
CACCTCATGC AGGGCATCCT GCTGGCACTG CGCACCCGGG AAACCTCCGG CGTCGGCCAG
AAGGTGGAGG TGACCATGTA CGACTCCATG CTGCACCTGC AGATGCAGGA GGCGTGCATG
CAGCTCAACC GCGGCTACGA GGTCAACTGG GGCGCCATGC CGCTCAGCGG AGTGTTCGAG
ACCACCGACG GCGCCGTCTG CATGGTGGGC GGTTTCACTC CGGACCCGCT GGCCCGCATC
TCCGAAGCCC TCGGGCTGGA CGAGGACCTT ACGCAGCGGC CCGAGTTCGC CAACCTGGAG
CAGCAGTTCG CGCACAAGCC GGCGCTGCAG GCCATCTTCC GCGAGCGCAT CGCCACCAAC
ACCACCGAGT ACTGGACCGG CAAGCTGGAA GACCAGGGGC TGCTCAACGC CCCGGTCCAC
ACCCTGGAGC AGGCCCTGGC CGATGCCCAG ACCGAGGCCA ACGGCATGAT CGTCGAGGCC
GAACACCCCG GCGTCGGGAC CGTGCGCATG CTCAACGCGC CCATCCGGCT CTCCGCCACG
CCTCCCACCG TCCGGCGCGC GGCGCCCCGG CTGGGCGAGC ACAACGTGGA GGTCCTGCTG
GAGAACGGGT TCGATGAGGA GACCATCGCG CGGCTGCAGC AGCTGGGGGT GCTCCGGTGA
 
Protein sequence
MSTVTLETAE LERTTAAPAA AGEPTATPLP LDGIKIVDFT QVFMGPSCTQ MLGDYGADII 
KVERPGAGDI SRNSFPDKDG QDNPIFLSIN RNKRSVSIDT RTEEGRNVLH AIMADADVVV
SNFRSGVMER MGFGYEELKA ENPGIIWASG TGFGPVGPYS HKGGQDAIAQ AYSGVMWRRE
SDDQKPSIYP TTLCDYITGM HLMQGILLAL RTRETSGVGQ KVEVTMYDSM LHLQMQEACM
QLNRGYEVNW GAMPLSGVFE TTDGAVCMVG GFTPDPLARI SEALGLDEDL TQRPEFANLE
QQFAHKPALQ AIFRERIATN TTEYWTGKLE DQGLLNAPVH TLEQALADAQ TEANGMIVEA
EHPGVGTVRM LNAPIRLSAT PPTVRRAAPR LGEHNVEVLL ENGFDEETIA RLQQLGVLR