Gene Arth_1336 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1336 
Symbol 
ID4446140 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp1498294 
End bp1499472 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content68% 
IMG OID639689144 
ProductnifR3 family TIM-barrel protein 
Protein accessionYP_830830 
Protein GI116669897 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0042] tRNA-dihydrouridine synthase 
TIGRFAM ID[TIGR00737] putative TIM-barrel protein, nifR3 family 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACTGTTG TAGCGACTCC TCCCGCCCCC AAGCTGGAAC TCCCGCCCCT GAAGCTGGGA 
CCCCTCACGG TGGACACCCC CGTGATCCTG GCCCCCATGG CGGGCATCAC CAACTCCGCC
TTCCGCAGGC TCTGCCGTGA ATACGGCGGC GGCATGTACG TGGCGGAGAT GGTCACCTCG
CGTGCCCTCG TGGAGCGCAC CCCCGAGTCG CTGCGGATCA TCTCCCACGA CGACGACGAA
AAGGTCCGCT CCGTCCAGCT GTACGGCGTG GACCCCGTGA CCGTCGGGCA GGCGGTCCGG
ATGCTTGTCG AGGAGGACCG GGCGGACCAC ATCGACCTCA ACTTTGGCTG CCCCGTTCCC
AAGGTGACCC GGCGCGGCGG CGGATCAGCC CTGCCCTGGA AGATCGACCT GTTTACCTCG
ATCGTCCAGA CGGCCGTCAA AGAGGCGTCC AAGGGCAACG TCCCGCTCAC CATCAAGATG
CGCAAGGGCA TTGACGAGGA CCACCTCACG TACCTCGACG CGGGCCGCAT CGCACGTGAT
GCCGGCGTCG CCGCCGTCGC CCTCCACGGC CGCACCGCGG CGCAGTTCTA TTCCGGCCAG
GCTGACTGGT CCGCCATCGC CCGGCTGCGA GAAGCGCTGC CGGACATTCC GGTCCTGGGC
AACGGCGACA TCTGGTCCGC CGAGGATGCC GTGCGCATGG TCCGAGAGAC CGGCGTGGAC
GGCGTGGTGG TGGGCCGCGG CTGCCAGGGC AGGCCCTGGC TGTTCGGGGA TCTCCAGGCG
GCTTTCGAAG GCAGCGACAC CCGCCACAGG CCGAACCTGC GGCAAGTGGC GGAGGGCGTC
TACCGGCACG CGGAACTGAT GGTGGAAACC TTCGGCGACG AAGGCAAGGC CCTGCGGGAA
ATCCGCAAGC ACATGGCGTG GTACTTCAAG GGATACGTGG TGGGCGGGGA ACTGCGCACC
AGGCTTGCCC TGGTCACCAG CCTTCAGGTG CTGCGCGATA CGCTGGCCGA GCTGGACCAG
GATTCCCCGT ACCCGGGTGC GGACGCCGAA GGCCCCCGCG GCCGCGCCGG TTCGCCCAAG
AGGCCGGCGT TGCCCAAGGA CTGGCTGGAA TCCCGGGCGC TGAACGCCGA ACAGTCCCAG
GACATCTCCG CCGCGGAACT GGACGTGTCA GGTGGCTGA
 
Protein sequence
MTVVATPPAP KLELPPLKLG PLTVDTPVIL APMAGITNSA FRRLCREYGG GMYVAEMVTS 
RALVERTPES LRIISHDDDE KVRSVQLYGV DPVTVGQAVR MLVEEDRADH IDLNFGCPVP
KVTRRGGGSA LPWKIDLFTS IVQTAVKEAS KGNVPLTIKM RKGIDEDHLT YLDAGRIARD
AGVAAVALHG RTAAQFYSGQ ADWSAIARLR EALPDIPVLG NGDIWSAEDA VRMVRETGVD
GVVVGRGCQG RPWLFGDLQA AFEGSDTRHR PNLRQVAEGV YRHAELMVET FGDEGKALRE
IRKHMAWYFK GYVVGGELRT RLALVTSLQV LRDTLAELDQ DSPYPGADAE GPRGRAGSPK
RPALPKDWLE SRALNAEQSQ DISAAELDVS GG