Gene Arth_0405 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0405 
Symbol 
ID4447100 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp435509 
End bp437077 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content65% 
IMG OID639688204 
Productlevansucrase 
Protein accessionYP_829906 
Protein GI116668973 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGCGGT GGCGCCCCGC CGCCGCAGCA CTGGCGGCGG CCGTGGCGGC TTCCGCCTTC 
CTTGCCGTTC CGTCGGCCCA GGCCAACGAA CCTTCGGACC CGCCCGCCAC CCAGCAGATG
CCGGCACCCA CCCCCGGCTT CCCGCTGCCC ACCGACCACA GCCAGAAGGC CTACGATCCG
GCGGCGGACT TCACCTCAAA GTGGACCCGC GCCGATGCCA AGCAGATCAT GGCCCAGAGC
GACTCCACCG TGGCTCCCGG CCAGAACTCC ATGAGCCCGG ATGTCACCAT GCCGGAAATC
CCTGAGGATT TCCCCGCGAT GAACGACGAC GTCTGGGTCT GGGACACGTG GTCCCTGACC
GACGAAAACG CCAACCAGAT CAGCTACAAG GGCTGGGACG TCATCTTCTC CCTCGTCGCT
GACCGCCACG CCGGCTACGG CTTCGACCAG CGCCACTGGA ACGCCCGGAT CGGCTACTTC
TTCCGCAAGA CCAACGCCGA CCCGGCCAAG GACAAGTGGA ACTACGGCGG ACACGTCTTC
GCTGACGGCG CTTCCATCGG CAACACCGAA TGGTCCGGCT CCACCCGCCT GATGCAGGGC
AACCAGGTCA ATGTGTTCTA CACGGCCACC ACGTTCTACG ACGTTGCCGA ACGCAATGCA
GGCGGCGGCG GCATCGCACC GGACGCGGCC ATCGCCAAGG CGCTGGGCAA GATCCACGCC
GACCAGAACG GTGTCACGTT CGACGGCTTC AAGCACACCA AGCTGCTGGA GCCGGACGGC
AAGATGTACC AGAACAAGGC CCAGAACCCG GGCTTCGCCT TCCGCGACCC GTACACGTTC
GCCGACCCCG CACACCCGGG CAAGACCTTC ATGGTCTTCG AAGGCAACAC CGGCGGCACC
CGCGGCGAAT ACGAGTGCAA GCCCGAGGAC CTTGGCTACA AGGCCGGCGA CCCCAACGCT
GAGAACCTCA ACGAGGTCAA CAGCAGCGGC GCCTACTACC AGACCGCCAA CGTGGGGCTG
GCAGTGGCGG ACAACAAGGA TCTGACCAAG TGGTCCTTCC TGCCGCCGAT CCTTTCGGCC
AACTGCGTCA ACGACCAGAC CGAGCGTCCC CAGATCTTCA TCCAGAATGA AGGCGGCAAG
AACAAGTACT ACCTGTTCAC CATCAGCCAC CAGTTCACCT ACGCGGCCGG CATGCGCGGC
CCCGACGGCG TCTATGGCTT CGTGGGCGAC GGTGTCCGTT CGGACTACCA GCCGATGAAC
AACAGCGGCC TGGCCCTGGG CTCGCCGACG GACCTGAACC TTCCGTCCGA GTCCCCCGAG
GCACCCACCC CGAACCAGAA CGGCCGCCAG TTCCAGGCCT ACTCGCACTA CGTGCAGCCG
GGCGGCCTGG TGCAGTCCTT CATTGACAAC GTGAACGGCG TCCGCGGCGG CTCACTCTCG
CCCACCGTGA AGATCAACTT CCGTGACGGC GTATCCCAGG TGGACCGCAC CTTCGGCAAG
AACGGCCTCG GCCCGTTCGG CTACCTGCCC ACCAACCTCA AGGTTGGCGG CGAGGGCCTC
TACAAGTAA
 
Protein sequence
MLRWRPAAAA LAAAVAASAF LAVPSAQANE PSDPPATQQM PAPTPGFPLP TDHSQKAYDP 
AADFTSKWTR ADAKQIMAQS DSTVAPGQNS MSPDVTMPEI PEDFPAMNDD VWVWDTWSLT
DENANQISYK GWDVIFSLVA DRHAGYGFDQ RHWNARIGYF FRKTNADPAK DKWNYGGHVF
ADGASIGNTE WSGSTRLMQG NQVNVFYTAT TFYDVAERNA GGGGIAPDAA IAKALGKIHA
DQNGVTFDGF KHTKLLEPDG KMYQNKAQNP GFAFRDPYTF ADPAHPGKTF MVFEGNTGGT
RGEYECKPED LGYKAGDPNA ENLNEVNSSG AYYQTANVGL AVADNKDLTK WSFLPPILSA
NCVNDQTERP QIFIQNEGGK NKYYLFTISH QFTYAAGMRG PDGVYGFVGD GVRSDYQPMN
NSGLALGSPT DLNLPSESPE APTPNQNGRQ FQAYSHYVQP GGLVQSFIDN VNGVRGGSLS
PTVKINFRDG VSQVDRTFGK NGLGPFGYLP TNLKVGGEGL YK