Gene Arth_1547 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1547 
Symbol 
ID4445948 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp1723664 
End bp1725523 
Gene Length1860 bp 
Protein Length619 aa 
Translation table11 
GC content65% 
IMG OID639689362 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_831041 
Protein GI116670108 
COG category[I] Lipid transport and metabolism 
COG ID[COG1022] Long-chain acyl-CoA synthetases (AMP-forming) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.174488 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATAACA AAGGAGCTAT CGTGCGCGAA TTCAGTGTTC CGCCCTTGGT GAATGTACCC 
CCGGAAACCA ACATCACCGA CCTGGTGTTG CGGCAGGCCG CCAAGGCCTC GAACCCGTCC
CTGTTTTCGC GGCTCGATGC CGCCGGACAG TGGCAGGATA TTTCTGCCAC GGATTTCCTC
GCCGACGTGC GCATCCTGGC CAAGGGCCTC ATGGCCAGCG GAGTGGCAGC AGGCGATCGC
GTCGGCATCA TGTCCCGCAC CCGCTACGAG TGGGCGCTGG TCGACTTTGC GATCTGGTTT
GCCGGTGGCA TCTCCGTTCC CATCTACGAA ACCTCCTCCC CCAGCCAGGT CGCCTGGAAC
CTGGGCGACT CGGGTGCCGT CGCGGCCTTC GGCGAGTCGG CGCACCACGA GGACATCATC
CGGCAGGCCG CAACCTCGGA AGGGCTTTCA TCCCTGGCCC ACGTCTGGCA GCTTGAGGGC
GCCGGGCTGG ACGAGCTCCG CGCGGCCGGC ACCACCGTCA GCGACGAGGA GCTCGAAGCC
CGCAGGAGCC TGGCTTCGCT GGCCGACGTC GCGACGATCA TCTACACCTC CGGCACCACC
GGACGGCCCA AGGGCTGCGA GCTGACGCAC GGGAACTTCG TGGAGTTGTC CGAGAATGCA
CTGGCCACCT CGCTCTCAGG CATCGTCCAC GAGCAGGCAC GAACCATCAT GTTCCTGCCA
CTCGCACACG TTTTCGCCCG GTTCATCTCG GTCCTGGCCG TGGCTGCCGG CGTCACTGTG
GCGCACACCC CGGACATCAA GCACCTCCTG CCGGACCTGC AAAGCTACAA GCCCACGTTC
ATCCTCGCCG TCCCGCGCGT ATTCGAAAAG GTCTATAACT CCGCGCTGAC CAAGGCCGAG
GACAGCGGCA AGGGCGCCAT CTTCCACAAG GCAGCCGACA CCGCCATCGC CTACTCGCGG
GCCCGGCAGG CCGGTTCCAT CGGCTTCGGC CTCAAACTCC GCCACGCGCT GTTCGACAAG
CTTGTCTATA GCAAGCTCCG CGCGGCCATG GGCGGCCAGG TGGCACACGC AGTGTCCGGC
GGCGGTCCGC TGGGTGAACG CCTGGGGCAC TTCTTCCAGG GCATCGGCAT GCAGATCCTT
GAAGGCTACG GCCTGACCGA AACCACCGCG CCGATCACGG TCAACACTCC CTCGCTCATC
AGGATCGGGA CGGTGGGCGC TCCCCTGCCG GGGAATGCGG TGAAAATAGC CGACGACGGC
GAGATCCTCG CCAAGGGCGT CTGCGTGATG CGCGGCTACT ACAAGCGCGA CGACCTCGCA
GCCGACACGT TCGTGGACGG CTGGTTCCGC ACGGGCGACA TCGGACAAAT GGACGCCGAC
GGCTTCCTGA CCATCACAGG CCGCAAGAAG GAAATCATCG TGACGGCCAG CGGCAAGAAC
GTGGTGCCTG CCCTGCTGGA AGACCAGATC CGGGCCGACG CCCTCGTCTC CCAGGTGCTG
GTTGTGGGCG ACAACATGCC GTTCATCGGA GCCTTGGTGA CACTCGATGA GGAAGCCCTG
CCGGGATGGC TGCAGCGTCA CGGACTTCCG GCCGGCACCA CGGTCGCGGA AGCGGCAGGC
CATCCGGTGG TCAAGGCTGC CGTCCAGGAC CTCATCACCC GCGCCAACCA GTCAGTGTCC
CAGGCGGAAG CCATTAAATC GTTCCGGATC GTACCGTCTG ATTTCACCGA GGCATCCGGC
CATCTCACCC CCTCCATGAA GGTCAAGCGG GCCCAGGTGA TGAAGGACTT CGACGCCGTC
ATCGCGGACA TGTACGCTAC ACCGCGGCCG GCCCGTACGG AGCCGTCCGG ACAGCACTAG
 
Protein sequence
MDNKGAIVRE FSVPPLVNVP PETNITDLVL RQAAKASNPS LFSRLDAAGQ WQDISATDFL 
ADVRILAKGL MASGVAAGDR VGIMSRTRYE WALVDFAIWF AGGISVPIYE TSSPSQVAWN
LGDSGAVAAF GESAHHEDII RQAATSEGLS SLAHVWQLEG AGLDELRAAG TTVSDEELEA
RRSLASLADV ATIIYTSGTT GRPKGCELTH GNFVELSENA LATSLSGIVH EQARTIMFLP
LAHVFARFIS VLAVAAGVTV AHTPDIKHLL PDLQSYKPTF ILAVPRVFEK VYNSALTKAE
DSGKGAIFHK AADTAIAYSR ARQAGSIGFG LKLRHALFDK LVYSKLRAAM GGQVAHAVSG
GGPLGERLGH FFQGIGMQIL EGYGLTETTA PITVNTPSLI RIGTVGAPLP GNAVKIADDG
EILAKGVCVM RGYYKRDDLA ADTFVDGWFR TGDIGQMDAD GFLTITGRKK EIIVTASGKN
VVPALLEDQI RADALVSQVL VVGDNMPFIG ALVTLDEEAL PGWLQRHGLP AGTTVAEAAG
HPVVKAAVQD LITRANQSVS QAEAIKSFRI VPSDFTEASG HLTPSMKVKR AQVMKDFDAV
IADMYATPRP ARTEPSGQH