Gene Aazo_3119 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_3119 
Symbol 
ID9340922 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp3209323 
End bp3210525 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content39% 
IMG OID 
Productgroup 1 glycosyl transferase 
Protein accessionYP_003721980 
Protein GI298491803 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAATTT GCATTGTTAC CCATAAAATC CGAAAAGGTG ATGGTCAAGG ACGAGTCAAT 
TATGAAGTAG CTATGGAAGC ACTCCGTCGT GGTCATAATT TGACATTATT GTCGAGTGAA
ATAGCACCTG AATTAGAACA TAATACTGCA GTAAATTGGG TTTCCATTTC TGTTGATGGA
TATCCGAGTG AATTTGTTCG CAATTTTGTA TTTGCTAAAA AAAGCGGTGA TTGGTTACGT
AAACATCGGG GCGAAGTTGA TTTAATTAAA GCCAATGGTG CGATTACCAT GGGCGCTACT
GATGTCAATG CTGTGCATTT CGTCCATAGT TCTTGGTGGA AATCGCCTGT ACATATTGCT
CGTCAACGAC GAGATTTATA TGGTTTATAT CAGTGGCTCT ACACTGCTAT TAATGCTTAT
TGGGAAAAAG AAGCTTTTCG CCAAACTAAA GTTGTTATCG CTATATCTAC AAAAGTAGCT
GAAGAATTAG TTAATATTGG CGTACCCCGT GCTAATATTC GTGTGATTGT GAATGGAGTT
GATTTACAAG AATTTTCCCC TGGTGCGAGT TCTCGCCAAA AGTTGGGGAT ACGTGAAAAT
GTGACGTTGG CATTGTTTGC TGGAGATATT CGCATTTCTC GCAAAAACTT AGATACTGTA
CTTCATGCCT TAGTAAAAGT TCCTAGTTTA CATTTAGCAG TTGTTGGTGA AACCAAAGAT
AGCCCCTATC CAGAAATGGT TGCGGACTTA AAATTAACTG AACGGGTACA TTTTTTAGGT
TATCGCCGTG ATATGCCGCA AATTCAACAG GCATCAGATT TATTTGTTTT TCCTTCCCGT
TATGAACCTT TTGGGTTAGT AGTAATTGAA GCGATGGCTT CAGGTTTACC TGTGATTACT
GCTAAAACCA CTGGTGCAGC TGATTTAGTA ACACCAGCTT GTGGAATTGT TTTACCCGAT
TGTGATGATA TTGATACTTT AGCCAATGCT TTGAAATTAT TAAGTAGCGA TCGCACACTA
CGTCAACAAA TGGGTAAAGT CGCTCGTACT ATTGCTGAAC AACATAGCTG GGTAACTATG
GCACAAACCT ATTTAGATTT ATTTGAAGAG TTAATGAAAC ATGAGGAATA CAGTTCTTAT
CCCCACCTAT CGCCGTCCTC AAGACTTGTT ACACTGCCTT TCCGCACTGC AAGCCCAAAC
TAA
 
Protein sequence
MRICIVTHKI RKGDGQGRVN YEVAMEALRR GHNLTLLSSE IAPELEHNTA VNWVSISVDG 
YPSEFVRNFV FAKKSGDWLR KHRGEVDLIK ANGAITMGAT DVNAVHFVHS SWWKSPVHIA
RQRRDLYGLY QWLYTAINAY WEKEAFRQTK VVIAISTKVA EELVNIGVPR ANIRVIVNGV
DLQEFSPGAS SRQKLGIREN VTLALFAGDI RISRKNLDTV LHALVKVPSL HLAVVGETKD
SPYPEMVADL KLTERVHFLG YRRDMPQIQQ ASDLFVFPSR YEPFGLVVIE AMASGLPVIT
AKTTGAADLV TPACGIVLPD CDDIDTLANA LKLLSSDRTL RQQMGKVART IAEQHSWVTM
AQTYLDLFEE LMKHEEYSSY PHLSPSSRLV TLPFRTASPN