Gene Nmul_A1900 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1900 
Symbol 
ID3784272 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2187707 
End bp2188753 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content56% 
IMG OID637811986 
Productcyanophycinase-like protein 
Protein accessionYP_412587 
Protein GI82703021 
COG category[P] Inorganic ion transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG4242] Cyanophycinase and related exopeptidases 
TIGRFAM ID[TIGR02069] cyanophycinase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.147675 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATACCG GAGATTGTGC CAATAAATCC TTCCCGCGGC TTTTTGCACA CCTTGCCCTG 
GCCTCTGCGG TGGTCTTCAT GACCTTCGGG GTTCTGGCAG AGCCCAAGCC CAAAGGCTAC
GAATATTACC TTACGGGCAA CGCGGTTGAT GCTGTCCTGC CGCAAAGGCC ACCATCGCCA
TCAACTCTCC TGATGGGCGG AGGTCCCGAT GTGGACGCTG CATTCAAGTG GATGATCCAG
AAGGCGGGAG GCGGCGACTT TGTGGTGATC CGTGTACGGG GAGCCGACGG CTACAACCAG
TATGTCTACG ACATGGGCGG TATAGATTCC ATCGAGACAC TGGTCATAAA GACGCGTGAG
GCCGCCAGCG ATCCGTTCGT GCTCGATCGG ATCAAAAAAG CGGAAATTTT GTTTATCGCG
GGCGGCGACC AGAGTGATTA CATTAATCTC TGGAAAGGAA CTGCGCTCGA AACGGCGATT
AATGAACTGA TTGGTCGCAA TGCACCTATC GGGGGCACCA GCGCGGGACT TGCAGTTCTG
GGTCAGTTTG ACTTCGTAGC GTTGAACGGC ACGGTGTACT CTGACGATGC GCTGGCCGAT
CCTTATAACC GTCGCATGAC TCTCGATCGA GAATTTCTGA CTGCGCCTGG CTTGAATGGG
GTAATTGCCG ATGCGCATCT CGACACGCGC GACCGGATGG GACGCCTTCT CACCTTTCTC
GCCCGTACCA TCCAGGACCA ATGGGTGAGC GTTGAATCCG CCCGAGGCAT TGGTCTGGAT
GTCGAAACCG CGTTGGCGAT TGACAATGGC ATTGCAATCC GCCTGGGTGT CGGCTCGGCA
TATTTTCTAA GGCCCACAAT TGCCCCAACC GTCTGTCAAA GCGGCCAGCC TCTGACTTTC
CGCAATGTCA TGGTAGACAG ACTCTCGGGA TCGGGATCAT TCCATCTCGG CCAGTGGACG
AGTCCGGGAA ATGGCACAAC CCGGTATGAC CTTTCAGCTG AAACCGGGGT GTTGGTCTCG
TCTCAGCTGG GGGGTGGAAT CTATTGA
 
Protein sequence
MDTGDCANKS FPRLFAHLAL ASAVVFMTFG VLAEPKPKGY EYYLTGNAVD AVLPQRPPSP 
STLLMGGGPD VDAAFKWMIQ KAGGGDFVVI RVRGADGYNQ YVYDMGGIDS IETLVIKTRE
AASDPFVLDR IKKAEILFIA GGDQSDYINL WKGTALETAI NELIGRNAPI GGTSAGLAVL
GQFDFVALNG TVYSDDALAD PYNRRMTLDR EFLTAPGLNG VIADAHLDTR DRMGRLLTFL
ARTIQDQWVS VESARGIGLD VETALAIDNG IAIRLGVGSA YFLRPTIAPT VCQSGQPLTF
RNVMVDRLSG SGSFHLGQWT SPGNGTTRYD LSAETGVLVS SQLGGGIY