Gene Arth_0051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0051 
Symbol 
ID4447486 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp56890 
End bp58290 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content59% 
IMG OID639687845 
Productinulin fructotransferase (DFA-I-forming) 
Protein accessionYP_829552 
Protein GI116668619 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCAAGCA ACAACTACTA CGACGTGACC ACGTGGCCCG TCGGCAATCC GTCCGAGGAC 
GTCGGTGAAG TCATCAACAG CATCATCGCT GACATCAAGG ACCGGCAGAC GGTCACCGAT
GCGAACAATG GAGGAAAGCC GGGCGCGGTG ATCTACATTC CGCCGGGGGA CTACCACCTT
CGTACGCAGG TTTTGATCGA CATCAGCTTC CTCAGGATCC ATGGCTCGGG ACACGGCTTT
ACGTCGTCCA GCATCCGGTT CAATGTTCCG GAAGACGAAT GGCCCGGGCT CCATGAGCTG
TGGCCCGGTG GGAGCCGGAT TATCGTCGAC ATTCCGCCCG GCGGAGACGA AGGTGGAGAC
GGGGAGGAAT CCAAGGGAGC CGCTTTCTAC GTTGAGCGGA GCGGGAGCCC GCGGATCAGC
TCGGTGGAGT TCTCCAACTT CTGCATCGAC GGCTTGCACT TCGACCCGGA TGGCTCGGGG
TCGCATCCGG AAAACACCTA CGTCAACGGC AAGACCGGTA TCTATGTTGC GAACGCCAAT
GACTCTTTCC GCATAACCGG CATGGGGTTT GTCTACCTTG AGAACGCCCT CACCATCTAC
AACGCGGACG CACTTTCCAT TCACGACAAC TTCATCGCTG AATGCGGCAG TTGCATCGAG
CTGCGCGGGT GGGGGCAGGC ATCGAAGATC ACCGACAACC TGGTCGGAGC AGGCTTCAAA
GGTCACTCAA TCTACGCCGA GAACCACGGC GGCCTCCTGG TAACTGCGAA CAACGTCTTC
CCCCGTGGCG CAAGCAGCAT CCATTTCGTA GGCGTCACGC GTTCAAGCGT CACCAATAAC
CGTTTGCATT CGTTCTACCC CGGGATGCTG ATCCTTGCGG AGAACAGTTC GGAAAACCTC
GTGGCCACGA ACCACTTCCT GCGTGACCAT GAACCGTGGA CGCCGTTCCT TGGAGTCGAC
AACGGACTGA ACGACCTCTA CGGACTGCTC TCTGTCAGCG GCAGCAATAA CTCTGTTATC
GGCAACCACT TCTCCGAGAT CATCGATTCA CCCAGCATCC AGCCGGAAGG AGCGACGCCC
GTCATCATCC GGCTGATGGC GGGGGTTGGC AACTTCGTCT CCAACAACCA CGTGGTGGCG
ATGGACGTTC GATCAAAGGC AAGTGACTCC TGCTTCTCGG CCCAGGTGGA CGCTCTGTTG
ACGACCGAGG CTTCGGACGG CCTCGCCGTT ACGGCCGTCA TGGTCGATTC CGAATCGGCC
CGGAATACGA TCCTGGATTC CGGAAGTGAC GCCCAGGTCA TCGCAGACAG GGCCGTTAAC
GCCTTGAGGG CCACGCCCAC CGTCGGTTTC CAGGCAGCCC ACGCACTTGT TGAGCCGCAC
GTAGAATCAG CAACAACATA A
 
Protein sequence
MSSNNYYDVT TWPVGNPSED VGEVINSIIA DIKDRQTVTD ANNGGKPGAV IYIPPGDYHL 
RTQVLIDISF LRIHGSGHGF TSSSIRFNVP EDEWPGLHEL WPGGSRIIVD IPPGGDEGGD
GEESKGAAFY VERSGSPRIS SVEFSNFCID GLHFDPDGSG SHPENTYVNG KTGIYVANAN
DSFRITGMGF VYLENALTIY NADALSIHDN FIAECGSCIE LRGWGQASKI TDNLVGAGFK
GHSIYAENHG GLLVTANNVF PRGASSIHFV GVTRSSVTNN RLHSFYPGML ILAENSSENL
VATNHFLRDH EPWTPFLGVD NGLNDLYGLL SVSGSNNSVI GNHFSEIIDS PSIQPEGATP
VIIRLMAGVG NFVSNNHVVA MDVRSKASDS CFSAQVDALL TTEASDGLAV TAVMVDSESA
RNTILDSGSD AQVIADRAVN ALRATPTVGF QAAHALVEPH VESATT