Gene Aazo_4977 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_4977 
Symbol 
ID9342784 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp5097681 
End bp5098718 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content46% 
IMG OID 
Productmetalloendopeptidase, glycoprotease family 
Protein accessionYP_003723224 
Protein GI298493047 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000823271 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAACCG TTTTAGCTAT CGAAACCAGT TGTGATGAAA CTGCTGTGGC AATTGTGAAC 
AATCGTCAAG TGTGTAGCAG TATCATAGCC TCGCAAATTC CTGTCCATCA ACAATATGGA
GGGGTAGTCC CGGAGGTAGC ATCACGCCAA CACTTAGAAA CTCTTAATCA ACAGATAGCG
CAAGCTATGG ATGAAAGCTC TATGGGTTGG GAACAAATTG ATGCGATTGC CGCCACTTGT
GCGCCAGGAC TGGTAGGAGC GTTGTTAGTA GGTTTGACAT CTGCCAAAAC TCTAGCGATG
GTTCATAAGA AGCCTTTTTT GGGAGTTCAT CACCTGGAAG GACATATTTA CGCAACTTAC
TTGGCGCAGC CTACTTTATA TCCCCCATTT CTTAGCTTAC TCGTTTCAGG TGGACATACA
AGCTTGATTT ATGTAAAAGA TTGTGGTAAA TACGAAACTC TAGGAGAAAC CCGTGATGAT
GCGGCCGGGG AAGCTTATGA TAAGGTAGCA CGGTTATTAA AGCTTGGTTA TCCGGGTGGA
CCAATCATTG ATAAATTAGC ACAAACAGGC GATACCCACG CATTTGCGCT ACCAGAAGGA
AAAATTTCTC TACCAGGTGG GGGTTATCAT CGCTATGATG CTAGTTTCAG CGGATTAAAG
ACTGCGGTGT TACGGTTAGT GCAGCAATTT GAGAAACATG GTAGAGAACT GCCAATAGCT
GATATTGCGG CCAGTTTTCA GGAAACCATA GCCAAAGCTT TAACCAAAAG AGCGATCACC
TGCGCCCGTG ATTATAAACT AGATACGATC GCCGTAGGTG GTGGCGTAGC AGCCAACACT
GGACTAAGAA AGCACCTACA AGCAGCAGCT GGGGAGCATA ACATCAGAGC CCTCTTCCCC
CCCTTAAAAT ATTGTACAGA CAACGCCGCT ATGATAGGCT GTGCAGCGGC TGATCATCTA
GCCCGTGGAC ATACATCACC TCTAACCTTG GGCGTGAACT CTCGGCTATC CCTAAGTCAA
GTTATGCAAT TGTATTAG
 
Protein sequence
MTTVLAIETS CDETAVAIVN NRQVCSSIIA SQIPVHQQYG GVVPEVASRQ HLETLNQQIA 
QAMDESSMGW EQIDAIAATC APGLVGALLV GLTSAKTLAM VHKKPFLGVH HLEGHIYATY
LAQPTLYPPF LSLLVSGGHT SLIYVKDCGK YETLGETRDD AAGEAYDKVA RLLKLGYPGG
PIIDKLAQTG DTHAFALPEG KISLPGGGYH RYDASFSGLK TAVLRLVQQF EKHGRELPIA
DIAASFQETI AKALTKRAIT CARDYKLDTI AVGGGVAANT GLRKHLQAAA GEHNIRALFP
PLKYCTDNAA MIGCAAADHL ARGHTSPLTL GVNSRLSLSQ VMQLY