Gene Ndas_3884 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3884 
Symbol 
ID9247755 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4655270 
End bp4656427 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content73% 
IMG OID 
Productphosphoribosylaminoimidazole carboxylase, ATPase subunit 
Protein accessionYP_003681787 
Protein GI297562813 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.964763 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGAGC GTAACCGCAC CGTCCCGCGC GTGGGAATGG TGGGAGGGGG CCAACTCTCC 
CGGATGACCC ACCAGGCGGG CATCGCCCTG GGGGTCGACT TCTCGGTCCT GGCGTCCAGC
CCCGCCGACA GCGCCGCGCT GGTGTGCGGT GACGTCTCCC TGGGCGACGA CCGCGGCCTC
GACGACGTCC TGGCCTTCGC CAAGGCCCAC GACGTGGTCA CCTTCGACCA CGAGCACGTG
CCCGAGCCCG TCCTGCGCGC GGTCGAGGAG GCCGGGGGCC TGCTGCGCCC CGGCCGCGAC
GCGCTGCGCT TCGCCCAGGA CAAACTGCGC ATGCGCACCC GTATGGCCGA GCTGGGCGCG
CCCTCGCCGC GCTGGCGTGC CGTCACCACC CTGGAGCACG TCACGGCCTT CGCCGGGGAG
ACCGGGTGGC CGGTCGTGCT CAAGGCGGCC CGCGGCGGCT ACGACGGCAA GGGCGTGTGG
GTCGTCGGTG ACGCCGACGA GGCCCGCGGG GTCGTGGACC GCGCCGCCGC CGAGGAGGTG
CCGCTCCTGG TCGAGGGGAA GGTGGACTTC TCGCGCGAGC TGGCCGTGCA GGTCGCCCGC
TCCCCGCACG GGCAGGTCGC GGTCTACCCG GTCGTGGAGA CCGTGCAGCG CGGCGGCATC
TGCCACGAGG TGATCGCCCC CGCCCCGGAC CTGTCCGAGG ACAAGGCCAC CCACGCCCAG
CAGCTGGCCA TCGAGATCGC CCAGGCGCTG GACGTGACCG GGGTCCTGGC CGTGGAGCTG
TTCGAGACCG CCGACGGCGT GGTCGTCAAC GAGCTGGCCA TGCGCCCGCA CAACTCCGGC
CACTGGAGCA TCGAGGGCGC GCGCACCTCC CAGTTCGAGC AGCACCTGAG GGCCGTGCTG
AACCTGCCGC TGGGCTCGCC GCGCACCAAC GCGCCCTACA CCGTCATGGC CAACCTGCTG
GGCGGCGAGG ACCCCGAGGT CTACCGCCGC TACCTGCACG TGATGGCGAA GGACCCCGAG
GTGAAGGTGC ACTTCTACGG CAAGGACGTG CGTCCGGGCC GCAAGATCGG GCACGTCACC
GTGATGGGTG AGGACTACCG TGACCTGCTG GCGCGCGCGC GAGACGCCGC CGCCTACCTG
CGAGGAGACG AACAGTGA
 
Protein sequence
MSERNRTVPR VGMVGGGQLS RMTHQAGIAL GVDFSVLASS PADSAALVCG DVSLGDDRGL 
DDVLAFAKAH DVVTFDHEHV PEPVLRAVEE AGGLLRPGRD ALRFAQDKLR MRTRMAELGA
PSPRWRAVTT LEHVTAFAGE TGWPVVLKAA RGGYDGKGVW VVGDADEARG VVDRAAAEEV
PLLVEGKVDF SRELAVQVAR SPHGQVAVYP VVETVQRGGI CHEVIAPAPD LSEDKATHAQ
QLAIEIAQAL DVTGVLAVEL FETADGVVVN ELAMRPHNSG HWSIEGARTS QFEQHLRAVL
NLPLGSPRTN APYTVMANLL GGEDPEVYRR YLHVMAKDPE VKVHFYGKDV RPGRKIGHVT
VMGEDYRDLL ARARDAAAYL RGDEQ