Gene Nwi_2552 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNwi_2552 
Symbol 
ID3675267 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter winogradskyi Nb-255 
KingdomBacteria 
Replicon accessionNC_007406 
Strand
Start bp2776124 
End bp2777404 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content66% 
IMG OID637714118 
Productphosphoribosylamine--glycine ligase 
Protein accessionYP_319157 
Protein GI75676736 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0151] Phosphoribosylamine-glycine ligase 
TIGRFAM ID[TIGR00877] phosphoribosylamine--glycine ligase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0177333 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACATTC TTTTGCTTGG TTCCGGCGGC CGCGAACATG CGCTGGCGTG GAAAATCGCC 
GGCTCGCCGC TGGTGACGAA ACTCTGGTGC GCGCCGGGCA ATGCCGGGAT CGCGCGCGAG
GCCGAATGCG TGGCGCTCGA CATCGCGGAT CATGCCGCGG TGATCGGGTT CTGCAAAGCC
AGCAAGGTCG ATCTGGTGGT GGTCGGTCCG GAAGCGCCGC TCGCCGCGGG CATTGTCGAT
GATCTCGCGG CCGCGGGCAT CAAGGCGTTC GGGCCGGGCA AGGCCGCCGC GCAGCTCGAA
AGCTCGAAAG GCTTCACCAA GGACCTGTGC CGCGACAACG GCGTTCCGAC AGCGGAATAT
GAACGCTTCA GGGACGCCGG AGCGGCGAAG GACTATATCC ACGCGCACGG CGCGCCGATC
GTCATCAAGG CCGACGGGCT TGCGGCCGGC AAGGGCGTCG TGGTGGCCAT GACGCTGGAC
GAGGCGCTGG CGGCGGTCGA CATGATGTTC GAGGGCGGCT TCGGTGCCGC GGGCGCTGAG
GTCGTGGTCG AGGAATTTCT TCGCGGCGAG GAAGCGTCGT TCTTCGCGCT ATGCGACGGC
GAACACGCGA TGCCGCTGGC GACCGCGCAG GACCATAAAC GGGCGTTCGA CGGCGACGAG
GGCCCGAACA CCGGCGGCAT GGGCGCCTAT TCGCCCGCGC CCGTGATGAC TGACGAAATG
TGCCGCCGCA CCATGGATGA GATCATCGTT CCGACATTGC GCGCCATGCG TGAGAAGGGC
GCGCCTTTCA AGGGCGTGCT GTTCGCCGGG CTGATGATCA CGGAACAGGG GCCGAAGCTG
ATCGAATACA ACGTCCGCTT CGGCGATCCG GAATGCCAGG TGCTGATGCT GCGGATGAGG
GGCGACATCG TGCCGGCGCT GTTCGCCGCC TGCGACGGGC AGCTGAATCA TTTCAGCATG
CGCTGGTTTC CCGATCCTGC GCTGACCGTG GTGATGGCGG CGAAGGGCTA TCCCGGCAGC
TACGCGAAGG GAACGCGCAT CGACGGAATA GACGAAGCCG CCAAGGCCGG GGGCGTGGAG
ATTTTTCATG CTGGCACGAA AGAGGAGGAC GGGCGCATTC TCGCGAGTGG CGGTCGCGTC
CTGAACGTCT GCGCGCTCGG CAATACCGTC GCGGAGGCGC AGGCCCGCGC GTATGAGGCC
GTCGATCGCA TCACATGGCC CGAGGGCTTC TGCCGTCGTG ACATCGGCCT GCGCGCGGTG
GCGCGGGAGA AGGGCGAATA G
 
Protein sequence
MNILLLGSGG REHALAWKIA GSPLVTKLWC APGNAGIARE AECVALDIAD HAAVIGFCKA 
SKVDLVVVGP EAPLAAGIVD DLAAAGIKAF GPGKAAAQLE SSKGFTKDLC RDNGVPTAEY
ERFRDAGAAK DYIHAHGAPI VIKADGLAAG KGVVVAMTLD EALAAVDMMF EGGFGAAGAE
VVVEEFLRGE EASFFALCDG EHAMPLATAQ DHKRAFDGDE GPNTGGMGAY SPAPVMTDEM
CRRTMDEIIV PTLRAMREKG APFKGVLFAG LMITEQGPKL IEYNVRFGDP ECQVLMLRMR
GDIVPALFAA CDGQLNHFSM RWFPDPALTV VMAAKGYPGS YAKGTRIDGI DEAAKAGGVE
IFHAGTKEED GRILASGGRV LNVCALGNTV AEAQARAYEA VDRITWPEGF CRRDIGLRAV
AREKGE