Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_4977 |
Symbol | |
ID | 9342784 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | - |
Start bp | 5097681 |
End bp | 5098718 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | |
Product | metalloendopeptidase, glycoprotease family |
Protein accession | YP_003723224 |
Protein GI | 298493047 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000823271 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAACCG TTTTAGCTAT CGAAACCAGT TGTGATGAAA CTGCTGTGGC AATTGTGAAC AATCGTCAAG TGTGTAGCAG TATCATAGCC TCGCAAATTC CTGTCCATCA ACAATATGGA GGGGTAGTCC CGGAGGTAGC ATCACGCCAA CACTTAGAAA CTCTTAATCA ACAGATAGCG CAAGCTATGG ATGAAAGCTC TATGGGTTGG GAACAAATTG ATGCGATTGC CGCCACTTGT GCGCCAGGAC TGGTAGGAGC GTTGTTAGTA GGTTTGACAT CTGCCAAAAC TCTAGCGATG GTTCATAAGA AGCCTTTTTT GGGAGTTCAT CACCTGGAAG GACATATTTA CGCAACTTAC TTGGCGCAGC CTACTTTATA TCCCCCATTT CTTAGCTTAC TCGTTTCAGG TGGACATACA AGCTTGATTT ATGTAAAAGA TTGTGGTAAA TACGAAACTC TAGGAGAAAC CCGTGATGAT GCGGCCGGGG AAGCTTATGA TAAGGTAGCA CGGTTATTAA AGCTTGGTTA TCCGGGTGGA CCAATCATTG ATAAATTAGC ACAAACAGGC GATACCCACG CATTTGCGCT ACCAGAAGGA AAAATTTCTC TACCAGGTGG GGGTTATCAT CGCTATGATG CTAGTTTCAG CGGATTAAAG ACTGCGGTGT TACGGTTAGT GCAGCAATTT GAGAAACATG GTAGAGAACT GCCAATAGCT GATATTGCGG CCAGTTTTCA GGAAACCATA GCCAAAGCTT TAACCAAAAG AGCGATCACC TGCGCCCGTG ATTATAAACT AGATACGATC GCCGTAGGTG GTGGCGTAGC AGCCAACACT GGACTAAGAA AGCACCTACA AGCAGCAGCT GGGGAGCATA ACATCAGAGC CCTCTTCCCC CCCTTAAAAT ATTGTACAGA CAACGCCGCT ATGATAGGCT GTGCAGCGGC TGATCATCTA GCCCGTGGAC ATACATCACC TCTAACCTTG GGCGTGAACT CTCGGCTATC CCTAAGTCAA GTTATGCAAT TGTATTAG
|
Protein sequence | MTTVLAIETS CDETAVAIVN NRQVCSSIIA SQIPVHQQYG GVVPEVASRQ HLETLNQQIA QAMDESSMGW EQIDAIAATC APGLVGALLV GLTSAKTLAM VHKKPFLGVH HLEGHIYATY LAQPTLYPPF LSLLVSGGHT SLIYVKDCGK YETLGETRDD AAGEAYDKVA RLLKLGYPGG PIIDKLAQTG DTHAFALPEG KISLPGGGYH RYDASFSGLK TAVLRLVQQF EKHGRELPIA DIAASFQETI AKALTKRAIT CARDYKLDTI AVGGGVAANT GLRKHLQAAA GEHNIRALFP PLKYCTDNAA MIGCAAADHL ARGHTSPLTL GVNSRLSLSQ VMQLY
|
| |