Gene Aazo_0998 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_0998 
Symbol 
ID9338793 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp1055961 
End bp1057373 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content39% 
IMG OID 
Productmammalian cell entry related domain-containing protein 
Protein accessionYP_003720493 
Protein GI298490316 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0397137 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGAAGTC TAATTAGCGG CTTCACGTCT ACACGAACTT TTAGAGAAGG CTCAGTGGGA 
TTATTACTTT TACTGGGTTT AGGAGCATTT GGAATAATTC TCCTATGGTT AAATAGAATC
CCCCTTGGAC GCAGTTCTTA TAAAGCTGTG GTGGAATTTG CTAACGCTGG GGGAATGCAA
AAAGGCTCAC CAGTTCGTTA TCGTGGCGTA AAAGTTGGTA GTATTTCCAA TATTAAAACC
GCAGTCAATG CTGTTGCTGT AGAAATTGAA ATCAACGATC CTAACTTGAT AATTCCCGCA
GATTCTAAAA TTCAAGCCAG TCAAACTGGA TTAATTAGCG AAAGTATTAT TGATATTACC
CCAATAACCA ACCTAGCGAC GGGAACTAAT ATTGCTAAAC CCTTAGACAA AGATTGTAAT
CCCAGTCTGA TTATTTGTAA TGAAATCAGT ACATTAAAAG GTCAAATTGG TATCAGCGTT
GATGAACTGA TTCGTCAATC ATCTGATTTT ACGGCTCAAT ATAATAACAA GGAATTTTAT
CAAAACGTTA ATCGCTTGTT AGTAACCTCT GCATCAGCAG CTTCTAGTGT TGCTAACCTC
AGTCGAGAAC TCCAGAGTGT GAGCAAAAGC TTTAAAGGGC AAATCGGCAC ATTTTCTAAT
ACTGCTGTCA CCATCCAGAA AGCTACAAAT GAACTCACTA CAACTACATC TAAAACCGCA
AATCAATTAG GTGAAACAGC CAGCGAGTTT AGTAAAACTG CACAACAAGC AGGTAGTTTG
TTGAATAACT TAGATGAATT ATTGACAACA AACCGTTCGT CCCTAGTTAG GACTTTAAAT
AATATTACTC AAACTAGTAA CCAACTGCGT CAGACAGTTA GTGGTTTATC ACCTGCGGTT
AATCGTTTAA ATGAAGGGGG ATTATTGAAT AATTTAGAAC TTTTGTCTGC GAACGCTGCG
GAAGCTTCAA CTAATTTAAA AGACGCATCC AAGACCTTAA ATAATCCTAA AAATATTGTC
TTACTTCAAC AAACCCTAGA TGCTGCCAGG GTGACATTTG AAAATACCCA AAAAATTACA
TCTGATTTAG ATGAATTAAC AGGCGATCCT CAATTTCGCC AAAATCTCCT GCAGTTGGTG
AATGGTTTGA GTAAATTAGT ATCTTCTACA CAGGATATGC AGGAACAAGC AAAGGTAGCT
GTCACCTTAG ACTCTCTCAA AGCATCTATG AACCAGGCAG AACTCTTAAC TTCTATCCCC
GTCAAAAAAG TTGAAGTAGA AAAACCAGAA TTTATCACCC CCACCCCAAT TGAAAAAGCT
GATGTAATTC AGTTAGATTT AGAAATATCT ACACCACCGG AAACTCCTCT TGGGGAAGCA
GGGGAGCAGG GAGCAGGGAG CAGGGGGAGA TAA
 
Protein sequence
MRSLISGFTS TRTFREGSVG LLLLLGLGAF GIILLWLNRI PLGRSSYKAV VEFANAGGMQ 
KGSPVRYRGV KVGSISNIKT AVNAVAVEIE INDPNLIIPA DSKIQASQTG LISESIIDIT
PITNLATGTN IAKPLDKDCN PSLIICNEIS TLKGQIGISV DELIRQSSDF TAQYNNKEFY
QNVNRLLVTS ASAASSVANL SRELQSVSKS FKGQIGTFSN TAVTIQKATN ELTTTTSKTA
NQLGETASEF SKTAQQAGSL LNNLDELLTT NRSSLVRTLN NITQTSNQLR QTVSGLSPAV
NRLNEGGLLN NLELLSANAA EASTNLKDAS KTLNNPKNIV LLQQTLDAAR VTFENTQKIT
SDLDELTGDP QFRQNLLQLV NGLSKLVSST QDMQEQAKVA VTLDSLKASM NQAELLTSIP
VKKVEVEKPE FITPTPIEKA DVIQLDLEIS TPPETPLGEA GEQGAGSRGR