Gene Arth_1493 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1493 
Symbol 
ID4445989 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp1653449 
End bp1657462 
Gene Length4014 bp 
Protein Length1337 aa 
Translation table11 
GC content69% 
IMG OID639689304 
Productnon-ribosomal peptide synthetase 
Protein accessionYP_830987 
Protein GI116670054 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01733] amino acid adenylation domain
[TIGR02353] non-ribosomal peptide synthetase terminal domain of unknown function 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.464324 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCCCG TGACTGAACA ACCGCAAGCC TCCGGATCCG CGCCCGCGCC GGAGTCCGAG 
TCCGGCCCCC TGGAAGGACG CTCTGCTCTT CCCGCCGTCC GGAACGTGCA TCTGCCGCAG
CTGCCCGGCG CCGCCGCGGC ACCGCCGGAG CGGACGCTGA TCGACATCCT GGAGGACACG
GCCCGGAAGT ATCCCGAGGC GTCGGCCCTG GACGACGGCC ACCGGCGCCT GAGTTACGCG
CAGCTGATGG CCGAGGTCCG GGCCACGGCA CGGGAGCTGC ATCTGGCTGG CCTCGGCGCC
GGCGACAAGA TCGGCGTCCG CATCCCTTCC GGCACCAACC GCCTCTATGT CTCGATCCTG
GCCATACTGC TGATCGGGGC TGCCTACGTT CCGGTGGATG CTGATGACCC CGACGAACGC
GCCAAGCTGG TGTTCAGTGA AGCCCGCGTC GGCGCGATCC TCAGAGGCAA TGGCGAGATC
GTCACGGACA GCAAGCGTCC CCGGCCGTTC CCGGCGCCGC GGAAACCGCA AGCCGACGAC
GACTCCTGGG TGATCTTCAC CTCCGGCTCC ACGGGCACCC CCAAGGGCGT CGCCGTCCAG
CACCGCTCCT CGGCGGCGTT CGTTGACGCC GAAGCCCGGC TGTTCCTGCA GGACGAACCC
ATCGGCCCCC AGGACCGGGT GCTCGCCGGA CTGTCCGTGG CCTTCGACGC GTCCTGCGAG
GAGATGTGGC TGGCCTGGCG CTATGGAGCC TGCCTGGTTC CTGCCCCTCG GTCGCTGGTC
CGGACCGGCA TGGACCTCGG CCCGTGGCTG ATCAGCCACG GGATAACAGT GGTGTCCACT
GTGCCCACCC TCGCGGCCCT GTGGCCGGCA GAAGCGCTGG AGAACGTGAG GCTGCTGATA
TTCGGCGGTG AGGCCTGCCC GCCGGAGCTC GCGGATCGCC TCGCCGTCGA CGGCAGGGAG
GTGTGGAACA CCTACGGCCC CACGGAAGCC ACCGTGGTTG CCTGCGCGGC GCAGCTGGGC
GTGCCGGGCC CGCTTCGGAT CGGCCTGCCG CTGGACGGCT GGGACCTTGC CGTGGTTGAC
GCCGGCGGCA TTCCGGTGGA GGAGGGCCAG GTGGGCGAGC TCATCATCGG CGGCGTGGGG
CTGGCCCGGT ACCTCGACCC CGCCAAGGAC GCCGAAAAGT ATGCGCCCAT GCCGTCCCTC
GGCTGGACCC GGGCCTACCG GTCCGGAGAC CTGGTCCGCT ATGAGGCATC GGGGCTCGTC
TTCATGGGCC GCGGCGACGA ACAGGTGAAG CTCGGTGGCC GGCGGATCGA ACTCGGAGAG
ATCGACGCCG CCCTGCAGGC ACTGCCTGAC GTTGCCGGAG CGGCGGCCGC GGTCCGCACG
ACGGCGGCCG GGAACCAGAT CCTCGTCGGT TACCTTGCCG CCGCGGGCAC CGCGGAGATC
GATCTCGCGG CGGCCAGGGA GCTGCTGGGA GCCAGCCTCC CGGCGGCGCT CATACCGCTC
CTCACTGTTG TGGAATCACT GCCCACCAAG ACGAGCGGAA AGGTGGACCG GCACGCCCTG
CCCTGGCCGC TCGCCGGCGC GGGGGCAGCA GATGCGGACA ATGCACCCCT GAACCTGCCG
GACGACGCAC GCTGGATCGT GGAGCAGTGG GGCAGCGTGC TGGGGACACC GGCGTCGAGC
CTGGATGCCG ACTTCTTTGC CCATGGCGGC GGATCCCTGG CAGCCGCGCA GCTCGTGTCC
GCGCTCCGTG TCCGCTATCC CACCATCACG GTGGCCGACA TCTACGCCAC CCCGAGGGTG
GGCGCGCTCA TCGACGTTGC CCGGCAATCC CTGCCCGAGG GGGCGGCCGG ACCGGCGCCG
GAGCGTGAGG TCCGTCCCAC AGCCCTGAAA TCGCAGGTCT TCCAGACTCT GATGGGGGTC
CCCCTGCACA TTCTGGTGGG CATGCGGTGG CTGACCTACG TGATGGCGGC CAACAACCTG
TTGGCCGCCC TTGCCGGCTT CACGGCTGCT CCCGTTGTTT CCTGGTGGTG GGTGGCCGCT
TCCTGGCTGG TGTTCGTGAG CCCGGCCGGC CGGATGGCCC TCTCGGTCCT GGCCGCCAGG
ACCCTGCTCC GGCGCGTGGT CCCCGGGACC TACCCGAGGT CCGGCAAGGT GCACCTGCGG
CTGTGGCTGG CAGAACAAAT CCAGGACTTG TCGGGCGCCA TCAGCCTGGC AAGCGCCCCC
TGGGTTCCGT ACTATGCGAA AGCGCTCGGG GCAAAGATCG GCAACGACGT CCAGCTGCAT
TCGCTGCCGC CCGTCACCGG AATGCTTTCG CTGGGGCGCG GCTGCAACAT CGAACCCGAA
GTGGATCTCT CCGGCTGGTG GATTGACGGC GACACCGTCC ACATCGGCCG GGTGCACGTT
GGCGCCGGCG CCACAGTGGG AGCACGCAGC ACGCTCATGC CAGGCGCCAG TATCGGAGCC
GGCGCCCAAG TGGAGCCGGG CTCGGCGGTT CTCGGCAAGG TCAAGGCCGG GCAGCTGGTG
TCCGGCTCCC CCGCCGAGCG GATCGGCAAA GCCAAACATG CGTGGCCGGA CACGCCGCCG
CCGGCGCATG CGCTGATCGG GCGGCTCTGG TTCGCCGGCT TTGCAGCAGC CTCGGCGGCG
CTGGCCACCA TCCCCTACCT GTCCGCTGCC GCCGGGGCGC TGGTGGTCTT CCTGTTCATC
CGCGGCAGCG AATCACTTTC GGCAGCCATC CCCCAGGTGC TTGCCTCGCT CCCGCTCGCG
GCGCTGACGT GGTTCATCAC CAACCTCCTG CTGATCCTTG CCACCACCCG GCTGCTGGGT
GTGGGACTCA AGGAAGGCTA CTTCAGGGTG CGGAGCCGGA TCGGCTGGCA GGTCTGGGCG
ACAGAACGGG TGCTGGACCT GGCCCGTGAC GTCCTGTTTC CGATCTACGC CAGCCTGTTC
ACCCCGGTCT GGCTGCGCCT CTTGGGAGCC AAGGTCGGCA AGAACGTTGA GGCCTCCACC
GTCCTCCTGC TCCCCAAGAT GACAACCGTC GGCGAAGGCG CGTTCCTTGC GGATGACACC
ATGGTGGCCT CCTATGAACT TGGCGGCGGG TGGATGCGGA TTGCCCCGGC CAAAATCGGC
AAACGCTCTT TCCTGGGCAA CTCCGGAATG ACGGGGGCCG GGCGAAGCGT GCCCAAGAAC
TCCTTGGTGG CCGTCCTCTC CGCCACCCCG GCCAAGGCGA AATCCGGCAC GTCCTGGCTG
GGCAGCCCGC CGGTCCGGCT GCGGCGTACA GCAATTGCCA CGGACAACAC CCTGACCTTC
CAGCCACCCC GGCGGCTGAA ACTGGCCCGC GCCCTGTGGG AGCTCTGCCG GTTCATTCCG
GTGGTGCTCA CGGTGGGCCT GGCCGCCGGC GTCATGCTCG CCTTCGACCG GCTCGCCTCG
CTATGGGGCT ACGGTTTCGC TGCCATCCTG GGAGGCATCG TGGTCCTGGT GGCAGGAGCC
GTGGCCGCGG CCAGCGCCGT TGCGGCCAAA TGGCTGCTGG TGGGCAAAAT CAGGCCGGGC
GAACATGCGT TGTGGAGCTC CTTCATCTGG CGGAACGAGG TAGTGGATAC GTTCATCGAA
ATGGTGAGTG CGCCCTGGTT CGCCCGCTCG GCTTCGGGCA CTCCTGCGCT CGTGTGGTGG
CTGCGCGGGC TGGGAGCGAA GATCGGCCGC GGCACATGGT GCGAAAGTTA CTGGCTGCCC
GAAGCAGACC TTGTCACTCT GGGGGAGAGC TCAACAGTCA ACCGGGGCTG CGTGGTCCAG
ACCCACCTGT TCCACGACCG GATCATGAGC ATCGACACTG TTACCCTCGA AGACGGAGCC
ACGATGGGCC CGCACGGTGT CATCCTTCCG CGGGCACGCA TCGCGAGGGG CGGCACGGTG
GGGCCGGCAT CCTTGGTCAT GCGCGGGGAG ACTGTTCCCG CCGCGACCTA CTGGATGGGC
AACCCGGTCA GCCCCTGGGC AGGTCCGGCC GTACCGGCTC CGCGGCTAAA GTAG
 
Protein sequence
MTPVTEQPQA SGSAPAPESE SGPLEGRSAL PAVRNVHLPQ LPGAAAAPPE RTLIDILEDT 
ARKYPEASAL DDGHRRLSYA QLMAEVRATA RELHLAGLGA GDKIGVRIPS GTNRLYVSIL
AILLIGAAYV PVDADDPDER AKLVFSEARV GAILRGNGEI VTDSKRPRPF PAPRKPQADD
DSWVIFTSGS TGTPKGVAVQ HRSSAAFVDA EARLFLQDEP IGPQDRVLAG LSVAFDASCE
EMWLAWRYGA CLVPAPRSLV RTGMDLGPWL ISHGITVVST VPTLAALWPA EALENVRLLI
FGGEACPPEL ADRLAVDGRE VWNTYGPTEA TVVACAAQLG VPGPLRIGLP LDGWDLAVVD
AGGIPVEEGQ VGELIIGGVG LARYLDPAKD AEKYAPMPSL GWTRAYRSGD LVRYEASGLV
FMGRGDEQVK LGGRRIELGE IDAALQALPD VAGAAAAVRT TAAGNQILVG YLAAAGTAEI
DLAAARELLG ASLPAALIPL LTVVESLPTK TSGKVDRHAL PWPLAGAGAA DADNAPLNLP
DDARWIVEQW GSVLGTPASS LDADFFAHGG GSLAAAQLVS ALRVRYPTIT VADIYATPRV
GALIDVARQS LPEGAAGPAP EREVRPTALK SQVFQTLMGV PLHILVGMRW LTYVMAANNL
LAALAGFTAA PVVSWWWVAA SWLVFVSPAG RMALSVLAAR TLLRRVVPGT YPRSGKVHLR
LWLAEQIQDL SGAISLASAP WVPYYAKALG AKIGNDVQLH SLPPVTGMLS LGRGCNIEPE
VDLSGWWIDG DTVHIGRVHV GAGATVGARS TLMPGASIGA GAQVEPGSAV LGKVKAGQLV
SGSPAERIGK AKHAWPDTPP PAHALIGRLW FAGFAAASAA LATIPYLSAA AGALVVFLFI
RGSESLSAAI PQVLASLPLA ALTWFITNLL LILATTRLLG VGLKEGYFRV RSRIGWQVWA
TERVLDLARD VLFPIYASLF TPVWLRLLGA KVGKNVEAST VLLLPKMTTV GEGAFLADDT
MVASYELGGG WMRIAPAKIG KRSFLGNSGM TGAGRSVPKN SLVAVLSATP AKAKSGTSWL
GSPPVRLRRT AIATDNTLTF QPPRRLKLAR ALWELCRFIP VVLTVGLAAG VMLAFDRLAS
LWGYGFAAIL GGIVVLVAGA VAAASAVAAK WLLVGKIRPG EHALWSSFIW RNEVVDTFIE
MVSAPWFARS ASGTPALVWW LRGLGAKIGR GTWCESYWLP EADLVTLGES STVNRGCVVQ
THLFHDRIMS IDTVTLEDGA TMGPHGVILP RARIARGGTV GPASLVMRGE TVPAATYWMG
NPVSPWAGPA VPAPRLK