Gene Arth_3583 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3583 
Symbol 
ID4443894 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4022593 
End bp4024344 
Gene Length1752 bp 
Protein Length583 aa 
Translation table11 
GC content67% 
IMG OID639691407 
Productlong-chain-fatty-acid--CoA ligase 
Protein accessionYP_833058 
Protein GI116672125 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAGAAGA ACAGCAATGC CCAGCAGCCC GGGCACCTCT GGAGCGACCG CCCGTGGACC 
AGTTCCTACG GGCCGGGCGT GCCGGCCGAC CTGGTGCTGC CCCAGGGCTC ACTGGTGGAC
CTCATGGACA GCTCCATACG TCGCTACGGG TCGAAGACCG CCCTTGAGTT CTTCGGCGCC
CGCACCAGCT ACCGTGAGCT CGGCGCACTG ATCAGCAAGG CGGCCGCCGG TCTGAAGAAA
CTGGGTGTCA AGGCCGGCGA CAGGGTTGCC CTGGTCATGC CGAACTGCCC GCAGCACATC
GTCGCATTCC ATGCCGTGCT GCGTCTCGGC GCGGTGGTGG TCGAACACAA TCCGCTTTAC
ACGGACCGGG AACTGCGCCA CCAGTTCGAG GACCACGGAG CCGCTGTCGC AGTTGTCTGG
GACAAGGCGG TGGAGCGGGT CCGGCAGTTG CCGGCCGACG TCGGGCTCCG GAGTATCGTC
TCGGTGGAGC TCATCCCGGC CATGCCCCTG GTGCAACGGC TGGCGCTGCG GCTTCCGGTT
CCCGCGGCCC GCAAGGCACG CGGGGCCCTC ACCGTGGGAA AGGACCAGCC GAAGGGCCGG
GCCGCTCCGG CTGCGCGGCC GGTTCTGCCC TGGCGGAAGC TCCTCGAATC CGGAGAGCTC
AAGAAGAAGC ATCCGCGCCC CGCGCCCCAG GACCTCGCCG TCCTCCAGTA CACGTCCGGC
ACCACCGGCT TGCCTAAGGG CGCCATGCTC AGCCACGCCA ACCTGCAGGC AAATGCGGCG
CAGGGCCGCG CCTGGGTGCC GGGGCTCAAG GAGGGTCGGG AAACCGTCTA CGCAGTGCTG
CCAATGTTCC ACGCTTACGG TCTGACGCTC TGCATGACCT TCGCCTTGAG CATCGGCGCG
AAGCTGGTCC TGTTCCCAAA GTTTGATGTG GACCTCGTGT TAAGGGCGCT CAAGAGGTCC
CCGGCGACCT TCCTGCCGGC CGTGCCGCCC ATTTATGACC GGATCGCGGC CGCGGCGGCT
GAACGCGGCA TCGGGCTGGA AAGCATCCGA TACTCCATTT CCGGTGCCAT GAACCTTCCG
ACGTCGACGG TGGAGACCTG GGAGAAGGCG ACAGGCGGCT ACCTGATCGA GGGCTACGGG
CTGACCGAGA CGTCCCCGAT AGCGATCGGC AACCCTTTCG GCCCCAGCCG CAAGCCGGGC
ACCGTCGGGG TGCCGTTCCC GCTGACCGAC ATCCGGGTGG TGGATCCCAG GAATGTTGCG
CGGGACCGCG CCCCAGGCGA GGAGGGGGAA CTCCTGATCC GTGGTCCGCA GGTGTTCTCC
GGCTACTGGA ACCGCCCGGA GGAGACCAAA GAGGCACTCC TCGACGGCGG CTGGTTCCGC
ACCGGCGACA TCGTCTCCGT GGACGACGAC TACTTCGTCA CGATCCGGGA CCGGATCAAG
GAGCTGATCA TCACGGGCGG GTTCAACGTC TCACCCAGCG AGGTGGAGGA CGTCCTTGCC
ACGTTCCCCG GTGTTTCGGA AGTCTCCGTG GTCGGGTTGC AACGCCCGAG TGGCGGCGAG
GACGTGGTCG CCGCAGTAGT GCCCATCCCG GGCACCACCA TTGATCCGGA CGCGCTCCTG
GCCTTTGCCC GGAAGCACCT GACCGCATAC AAGGTGCCGC GCCGGGTGGT GGTGCTTGAT
TCCCTCCCGC GCTCGCTCAT AGGCAAGGTC CTCCGTCGTG AGATCCGGGA CACCCTCGTG
GCCGGGCGGT GA
 
Protein sequence
MKKNSNAQQP GHLWSDRPWT SSYGPGVPAD LVLPQGSLVD LMDSSIRRYG SKTALEFFGA 
RTSYRELGAL ISKAAAGLKK LGVKAGDRVA LVMPNCPQHI VAFHAVLRLG AVVVEHNPLY
TDRELRHQFE DHGAAVAVVW DKAVERVRQL PADVGLRSIV SVELIPAMPL VQRLALRLPV
PAARKARGAL TVGKDQPKGR AAPAARPVLP WRKLLESGEL KKKHPRPAPQ DLAVLQYTSG
TTGLPKGAML SHANLQANAA QGRAWVPGLK EGRETVYAVL PMFHAYGLTL CMTFALSIGA
KLVLFPKFDV DLVLRALKRS PATFLPAVPP IYDRIAAAAA ERGIGLESIR YSISGAMNLP
TSTVETWEKA TGGYLIEGYG LTETSPIAIG NPFGPSRKPG TVGVPFPLTD IRVVDPRNVA
RDRAPGEEGE LLIRGPQVFS GYWNRPEETK EALLDGGWFR TGDIVSVDDD YFVTIRDRIK
ELIITGGFNV SPSEVEDVLA TFPGVSEVSV VGLQRPSGGE DVVAAVVPIP GTTIDPDALL
AFARKHLTAY KVPRRVVVLD SLPRSLIGKV LRREIRDTLV AGR