Gene Aazo_2018 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_2018 
Symbol 
ID9339811 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp2092855 
End bp2094327 
Gene Length1473 bp 
Protein Length490 aa 
Translation table11 
GC content44% 
IMG OID 
Productleucyl aminopeptidase 
Protein accessionYP_003721206 
Protein GI298491029 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGATTC AACCTACAAA TACAGCTTTA TTAGATTGGA CTGGTGATAC TTTAGCAGTT 
GGCTTATTTG AAGACGCAGT AGACTTAACA GGGGATTTGG CAATTTTAAA TGATAACTTG
GGCGGCCTGT TAAAAGAACT GATTGCTGAA GAAGAATTTA CAGGTAAAGC TAATAGCACT
ATAGTTGCAC GGGTAGGTGC TGGTCATCCA GTGCGGAAGG TAATTTTAGT TGGTTTAGGT
AAACCTGACG CGCTGAAATT AGAAACTTTA CGCTGTCTTG CGGCTACAGT AGCGCGGACT
GCTAAAAAGC AAAAAACCAA AACTCTAGCC ATCAGTTTAC CTGTTTGGAA CAATGACCAA
GCAGCTACAG GCCAAGCTAT TGCAGAAGGT ACACAACTGG CACTTTACCA AGACATTCGT
TTCAAATCAG AACCGGAAGA TAAAAATCCC CCCATCGAAA CTATTGATTT ACTCGGGTTA
GGGGGACAAG AAGCAGCTAT TACCCGTGCA GAACAAATTG TTTCTGGTGT GATTTTGGCT
AGGCAGTTAG TAGCAGCCCC AGCAAATGCT GTTACACCCA TTACCATGGC AGAAACTGCC
AAAGCCATAG CTAAAGATCA CGGTTTACAT CTGGAAATTC TGGAACAAGA AGAATGTGAA
AAACTAGGCA TGGGTGCATT TTTAGGAGTT GCCCAAGCTT CTGATTTACC ACCTAAGTTT
ATTCACCTGA TTTACAAACC AGCTACCACA CCCAAGCGCA AACTCGCCAT TATTGGTAAA
GGTTTAACCT TTGACTCCGG TGGTTTGAAT ATTAAAGGTG CTGGTAGCGG CATTGAAACC
ATGAAAATTG ACATGGGTGG TGCTGCTGCT ACTCTTGGTG CAGCTAAAGC CATTGGTCAG
CTAAAACCAG ATGTGGAAGT TCACTTTATC TCAGCCGTTA CGGAAAACAT GATTAGCGGT
CGCGCTATGC GCCCAGGAGA TATCCTCACC GCTTCCAATG GTAAAACAAT CGAAGTCAAC
AACACCGATG CTGAAGGCCG TTTAACCTTG GCTGATGCCT TGGTATATGC CGATAAATTG
GGAGTAGATG CGATCGTTGA TTTAGCTACT CTTACAGGTG CTTGTGTAGT TGCCTTGGGT
GACGATATCG CCGGTTTATT TACACCTGAT GATGCTGTAG CTTCCCAACT GCAAACCGCC
TCAGAATCAG CAGGTGAGAA ACTTTGGCGG CTACCAATGG AAGAGAAATA TTTTGAAGGG
CTGAAATCGG GTATTGCTGA CATGAAAAAT ACTGGACCCC GTTATGGTGG TTCTATTACT
GCGGCTTTAT TCCTCAAACA GTTTGTTAAG GATACTCCTT GGGCACACTT AGACATTGCA
GGTCCAGTTT GGGCTGATAA GGAAAATTGC TATAACGGTG CAGGTGCAAC CGGTTTTGGT
GTCAGAACCT TGGTTAATTG GGTACTGAGT TAA
 
Protein sequence
MTIQPTNTAL LDWTGDTLAV GLFEDAVDLT GDLAILNDNL GGLLKELIAE EEFTGKANST 
IVARVGAGHP VRKVILVGLG KPDALKLETL RCLAATVART AKKQKTKTLA ISLPVWNNDQ
AATGQAIAEG TQLALYQDIR FKSEPEDKNP PIETIDLLGL GGQEAAITRA EQIVSGVILA
RQLVAAPANA VTPITMAETA KAIAKDHGLH LEILEQEECE KLGMGAFLGV AQASDLPPKF
IHLIYKPATT PKRKLAIIGK GLTFDSGGLN IKGAGSGIET MKIDMGGAAA TLGAAKAIGQ
LKPDVEVHFI SAVTENMISG RAMRPGDILT ASNGKTIEVN NTDAEGRLTL ADALVYADKL
GVDAIVDLAT LTGACVVALG DDIAGLFTPD DAVASQLQTA SESAGEKLWR LPMEEKYFEG
LKSGIADMKN TGPRYGGSIT AALFLKQFVK DTPWAHLDIA GPVWADKENC YNGAGATGFG
VRTLVNWVLS