Gene Arth_4024 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_4024 
Symbol 
ID4447825 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4542540 
End bp4544132 
Gene Length1593 bp 
Protein Length530 aa 
Translation table11 
GC content66% 
IMG OID639691855 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_833499 
Protein GI116672566 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTTCTCAA GCCCCTTTTC GGATGTTGTC ATCCCGGACC AAAGCGTTTA TGAGTACCTC 
TTCGGGGGTC TCACGGAAGC GGATCTGGAC CGGACCGCCG TCGTAGACGG CAGCAGCGGC
GCGGAAACCT CCTACCGCCA GTTGCTGGAA CAGATCGACG CCGTGGCGGG AGCAGTCTCC
GCACAGGGAC TAGGTCCGCA CGGGGTTGCC GCAATCCTCT GCCCCAACGT CCCGGCGTTC
GCTGCCGTCT TCCACGGCCT GCTCCGGGCC GGCGCCACCA TCACCACCAT TAACTCGCTC
TATACCGCCG ATGAAATCAC GCTTCAGCTG CAGGACGCCG CTGCAACGTG GCTGTTCACG
GTGTCCGCCC TGCTTCCGGG TGCCGTGCAG GCGGCCGAGC GTGCCGGGAT CCCGGCGGAC
CGGCTCGTGG TGCTCGACGG CGCCCCGGGT CACCCCTCGC TGAAGGACCT GCTCACCGCC
GGAGCGCCGG TACCTGCCGT TTCCTTCGAC CCGGCCACCC ATGTGGCCGT GCTGCCGTAC
TCCTCCGGTA CCACCGGGCG GCCCAAAGGC GTGAAGCTCA GCCACCGCAA CCTCGTGGCC
AACGTGGAAC AGTCCCGCGG GCTTCTGAAG GTGAAGCCGC AGGACCGGCT TCTTGCCCTG
CTGCCGTTCT TTCACATCTA CGGGCTTACT GTCCTGTTGA ACCTCGCACT GCGGGAACGG
GCCTGCCTGG TCACCATGCC CCGGTTCGAC CTCGCCGAGT TCCTGCGCAC CATCCAGGAC
CACAAATGCA CGTACCTGTT CATCGCGCCG CCGGTGGCCG TGGCGTTGTC CAAACACCCG
CTCGTTGCGG AGTACGATCT CAGCTCCGTC CACACCACGC TGTCCGGTGC CGCGCCGCTC
GACGGGGAAC TCGGCGCCAC GCTCGCCGAA CGCCTCCATT GCCGTGTGCT GCAGGGTTAC
GGGATGACGG AGATGAGTCC TGTGTCGCAC CTGATCCCGG TGGATGCGCC GGACGTTCCG
GTGAGCTCGG TGGGCTTCAC GGTGCCCAAC ATGGAATGCC GGCTGGTGGA CCCTGCCACA
GGCGAGGACA TCGACATCCC GGCGGAGGGA ACCAGTGCCC CGGGCCACCT GCTGTGCCGG
GGACCGAATG TCATGCTTGG ATACCTCAAC CGTCCGGAGG AAACGGCCGA CACCCTGGAC
CCGGACGGTT TCCTGCACAC TGGTGACATC GCGACAGTCC GGGCCGACGG TGTGGTGACC
ATCGTGGACC GGCTGAAGGA ACTCATCAAA TACAAGGGAT ACCAGATCGC ACCGGCCGAA
CTTGAGGCGC TGCTGCTGTC GCACCCGGGC ATCGCCGATG CCGCCGTGAT TGGGACACCC
GACGCCGACG GCCAGGAAGT GCCGATGGCC TTCGTCGTGC GTCAGCCGGG CGCGGAAGGG
GAAGCGCTCG ATGAAGACGG CGTCATCGAC TTCGTGGCCT CCCGGGTGGC ACCCTTCAAG
AAGATCCGCC GGGTGGAGTT CATCGAGGCC GTGCCCAAGT CCGCCTCCGG GAAGATTCTT
CGCAGGATGC TCAAGACGGC CCAGTCGGCC TGA
 
Protein sequence
MFSSPFSDVV IPDQSVYEYL FGGLTEADLD RTAVVDGSSG AETSYRQLLE QIDAVAGAVS 
AQGLGPHGVA AILCPNVPAF AAVFHGLLRA GATITTINSL YTADEITLQL QDAAATWLFT
VSALLPGAVQ AAERAGIPAD RLVVLDGAPG HPSLKDLLTA GAPVPAVSFD PATHVAVLPY
SSGTTGRPKG VKLSHRNLVA NVEQSRGLLK VKPQDRLLAL LPFFHIYGLT VLLNLALRER
ACLVTMPRFD LAEFLRTIQD HKCTYLFIAP PVAVALSKHP LVAEYDLSSV HTTLSGAAPL
DGELGATLAE RLHCRVLQGY GMTEMSPVSH LIPVDAPDVP VSSVGFTVPN MECRLVDPAT
GEDIDIPAEG TSAPGHLLCR GPNVMLGYLN RPEETADTLD PDGFLHTGDI ATVRADGVVT
IVDRLKELIK YKGYQIAPAE LEALLLSHPG IADAAVIGTP DADGQEVPMA FVVRQPGAEG
EALDEDGVID FVASRVAPFK KIRRVEFIEA VPKSASGKIL RRMLKTAQSA