Gene Nmul_A1689 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1689 
Symbol 
ID3784615 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1929024 
End bp1930148 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content51% 
IMG OID637811775 
Productcyanophycinase-like protein 
Protein accessionYP_412379 
Protein GI82702813 
COG category[P] Inorganic ion transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG4242] Cyanophycinase and related exopeptidases 
TIGRFAM ID[TIGR02069] cyanophycinase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAATTAC TTGTGTGGTT TTTTCTCGCT CAGCAGCAAA CAAACTCCAG AATTATATGC 
AGATGGATGG CAACCCCAAT CGTGGTTGCT GTGTTTTCGA CAGGGATAGG CTTGGTAGAG
GCAGCCAATA AATCGCGCTT AGGGATCTCT CTAAATGGCG AGCCGGTCGA TTACCTGCCC
TGCGGCACAA TTCCGTTAAG CACGCGAACT GCTGTATTAA TGGGTGGAGG AGACGACGTC
AAGGAGGCCT TCAGCTGGAT GATTGCCAAG ATGAGCCAAT GTGGCGATGG CAATACAGGG
AGGCCGGGAA ATTTTGTCGT GATTGATAAT GGCAGCGTCC CGCCTGACGA TACTTACATC
AGCATAGTCG GGCCCGTCGC CTCGGTAGTA ACTCTGGTTG TTCCTGACAT AGAAACAGCT
AATGACCCCG CCCTCGAGCC TTACATTCGA AATGCCGGCG CAATCTGGCT AACCGGGGGC
GATCAAGGGC GCTACTACAA TTTCTGGAAG GACTCGTTGC TGGAACAACT GATATCAAAG
CAGGTCCGGA ATTTCAAGAT TCCCATTGGC GGAACAAGCG CGGGAACCAT GGTACTCAGC
GAGTTCGCTT ATGTCGCCGA CCCGTGCGCG ATCACTTCGT CAAAAGCCTT GACCGACCCC
TACTCACAGT GCGTAGCACT GAGACGTGAT TTTTGGAGCG ACAGGACGCC TTTACCCCCC
CTGTTATCGA CTGTCACTGA TTCCCATTTC AATGCGCGCG ATCGCATGGG CCGCCTGATC
ACGTTCCTGG GGCACGCGAT AAATAGCCAA TGGACCAGTG CTGCCATTGC CCAGGCTATC
GGAGTAGATG AGGAAACAGC ATTATTGATG GAAATTGATG ACAATACAGA CCCATCTTCT
CCCGGCACCA ATTTTAGTTA CAAGGTCATT ACAAATACAG GAGTCAGCGG GTCAGTCTAT
ATTCTCAGCA CCGATTCGCA AAGTCAGCTG AACCTTGAGC CCGACCAGCC TCTTAGCTTT
ACCAATGTAA AAGTGAGAAA GATAGAGACT GCGGGAAATG AGAGTGATTA TATTATCGAC
GTCAAGGAAG GCGATTTAAT ATCCAGCACT GGCAGCATTT ACTGA
 
Protein sequence
MELLVWFFLA QQQTNSRIIC RWMATPIVVA VFSTGIGLVE AANKSRLGIS LNGEPVDYLP 
CGTIPLSTRT AVLMGGGDDV KEAFSWMIAK MSQCGDGNTG RPGNFVVIDN GSVPPDDTYI
SIVGPVASVV TLVVPDIETA NDPALEPYIR NAGAIWLTGG DQGRYYNFWK DSLLEQLISK
QVRNFKIPIG GTSAGTMVLS EFAYVADPCA ITSSKALTDP YSQCVALRRD FWSDRTPLPP
LLSTVTDSHF NARDRMGRLI TFLGHAINSQ WTSAAIAQAI GVDEETALLM EIDDNTDPSS
PGTNFSYKVI TNTGVSGSVY ILSTDSQSQL NLEPDQPLSF TNVKVRKIET AGNESDYIID
VKEGDLISST GSIY