Gene Arth_3406 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3406 
Symbol 
ID4444136 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3831303 
End bp3832643 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content69% 
IMG OID639691230 
Productphosphoribosylamine--glycine ligase 
Protein accessionYP_832881 
Protein GI116671948 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0151] Phosphoribosylamine-glycine ligase 
TIGRFAM ID[TIGR00877] phosphoribosylamine--glycine ligase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCACAAAG TACCCTTGGA GACTGTGAAG GTACTCGTCA TTGGCCCTGG TGGCCGCGAA 
CACGCCATTG TCCGCTCCCT GCTTGCAGAC CCCAACGTTT CCGAAGTCCA TGCGGCTCCG
GGCAACGCGG GTATCGGCAA GCTGGTCCCC ACCTACGCCA TTGACGGCAA TGATCCGGAC
GCCGTAGCGG CCCTGGCCAC CAAGCTGGGT GTGGACCTCG TGGTGGTTGG TCCCGAGGCG
CCCCTGGCCG CCGGGGTTTC CGATGCCGTC CGTGCAGCCG GGATCCCCGT CTTCGGACCC
AGCAAGGCGG CCGCCCAGCT GGAGGCCTCC AAGGCATTCG CCAAGCAGGT CATGGCCGAG
GCCGGCGTTC CCACCGCCAT GGCGCGCGTT GCGAGCACCG CCGAGGAAGC TGCCGACGCG
CTGGACACCT TCGGCGCCCC CTACGTGGTC AAGGACGACG GCCTGGCCGC CGGCAAGGGC
GTGGTGGTTA CCAACAACCG GGACGAAGCC CTGGCCCACG CCCAGAGCTG CTTCGACGCG
GGCGGCTCCG TGGTGATCGA AGAGTTCCTG GACGGTCCCG AGGTTTCCGT GTTCGTCCTG
TGCGACGGCC GGAACACGGT GGCACTCTCC CCGGCGCAGG ACTTCAAGCG CATCTTCGAC
AACGACGAAG GCCCCAACAC CGGCGGCATG GGCGCCTACA CCCCGCTGGA GTGGGCGCCC
GAAGGCCTGG TCCAGGAAGT CATCGACCGC GTGGCGCAGC CCACGGTCAA CGAGATGGCG
CACCGCGGAA CCCCGTTCGT CGGCGTGCTG TTCGTGGGCC TGGCCCTGAC CTCGCGCGGC
ACCCGCGTCA TCGAATTCAA CGTCCGCTTC GGCGATCCGG AAACCCAGGC CGTCCTGGCC
CGGCTCAAGA CGCCGCTCGG TGCGCTGCTG CTGGCAGCTG CCAAGGGCGA ACTGGACAAA
GCGGAAGAGC TGCGCTGGTC CAAGGACACC GCGGTCGCCG TCGTCGTCGC CTCGGAAAAC
TACCCGGACA CCCCGCGAAC GGGTGACCGC ATCCGCGGCC TCAAGAAGGT GGACGAGCTG
GAAGGCGTCC ACGTGATCCA CGCCGGCACC AAGCTGGACG AGGAAGGCAA AGTGGTCTCC
GCCGGCGGCC GCGTGCTCGC CGTGGTCGCG CTGGGAACCG ACCTCGTGGA GGCCCGGGAA
CGCGCGTACG ACGGCGTGGA GCTGGTACAG CTCGAAGGCG GGCAGTTCCG CACCGACATC
GGGCGCAAGG CGGCCCGCGG CGAAATCAAG GTCTCGGCCC CGTCCACCGG AACGCTGCCC
GTAACGAAGG CGAAGGCATA G
 
Protein sequence
MHKVPLETVK VLVIGPGGRE HAIVRSLLAD PNVSEVHAAP GNAGIGKLVP TYAIDGNDPD 
AVAALATKLG VDLVVVGPEA PLAAGVSDAV RAAGIPVFGP SKAAAQLEAS KAFAKQVMAE
AGVPTAMARV ASTAEEAADA LDTFGAPYVV KDDGLAAGKG VVVTNNRDEA LAHAQSCFDA
GGSVVIEEFL DGPEVSVFVL CDGRNTVALS PAQDFKRIFD NDEGPNTGGM GAYTPLEWAP
EGLVQEVIDR VAQPTVNEMA HRGTPFVGVL FVGLALTSRG TRVIEFNVRF GDPETQAVLA
RLKTPLGALL LAAAKGELDK AEELRWSKDT AVAVVVASEN YPDTPRTGDR IRGLKKVDEL
EGVHVIHAGT KLDEEGKVVS AGGRVLAVVA LGTDLVEARE RAYDGVELVQ LEGGQFRTDI
GRKAARGEIK VSAPSTGTLP VTKAKA