Gene Aazo_4658 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_4658 
Symbol 
ID9342465 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp4764469 
End bp4766103 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content35% 
IMG OID 
Productfamily 39 glycosyl transferase 
Protein accessionYP_003722998 
Protein GI298492821 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.538704 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTGTACA AATTACCTTT ATTTGTCGCT ATTAAACAGC GAATTCTCCT ACTGAAGCAA 
TTTCCCCACA TCAGTTTGTT AATTTGGCTA ATTCCTTTAT TGTTATTTAC TTCTGGTGAA
ACTAGTCTGA TGGCCCATGA TGAAACTCTT TATGCGAGGA GAGCGCGATT AATCTTTGAT
TCTGGTGACT GGATAGCACC TTGGAAAACA GCCCATCATA AAACCCCTGG TTTTTATTGG
TTAATTGCTA TTTTTTATCA GTTATTTGGC GTTAGTGATA CTAGTGCGCG AATACCTAGT
ATGATTGCCG GAATTTTGAG CATATTAGTT ATATATGAAA TAGCGAAGAT ATTGCTATAT
CAAAAATTAG CTTACCTATC GGCAGCTATT TTGAGTGTGG AATTTCTCTG GCTGCAATAT
TGTCGCCTAA CTGCACCTGA TGTACCAATG ATTTTATTGG TATGTTTAGC TATTTTGTGT
TTATTAAAAG CAGAAATATA TCCTAAATAT CGAGTTATTT ATGGTTTTAT AGTCGGTGTC
AGTTTTGGGT TAGGCTTTTT GATGAGAAGC TTTATGATTT GTTTGCCAAT AGCGGCATTA
TTACCTTATT TAATTTGGGA ACATCGTCGG CATCGTCATC TTACTAACCC AATGCTCTAT
TTAGGAGGTT TAGTCGGTTT AATTCCTACT TTAGTTTGGT TATGGTTTAA CTGGCAGCGT
TATGGTGGTG ATAGTGTTGG TCATTTGTTG GGATTTGTGG CAGAATTAGG TAGCAGTAAA
CGTCCGGGTA ATGGCATACT TTACTATTTG TGGAATGTGC CGTTAAAATC TTTTCCTTGG
TGTTTATTCA GCATTTTAGG GTTAGTTTTA GTAATTCGTA AACCCATCTC TCGGTACCAA
CTACTATTAG TAGGTTTTCC AATATTTCTA TTTACAGAAA TCACTATATT TTCCACTCGG
TTAATGCACT ATAGTCTTTG TTTATATCCC TTTATCGCTA TGTTGGCGGC TGTGGGGTTA
GATTGGTTAG CAAAAATTTA CCAAGAAGGA AAAGCGAATA AAATTAATGC CCATCTTCCT
AGAAATATCA GCTATTCTTT TGGTGTTCTA GGCATTATCT TAATAATAGC TGGTATAGTG
GTTTTAGTTG CAAATTTGGT TGATATTAAA AAATATGCCA TTGTGGCTTT AGTCACAGGG
TTAGGTTGCT TAATTGTGCC TTTAATTTGG ATTTTACGTT ACTCTCTTCA TCAAAATTTT
CTTACGGCTC CTTACTGGGT GAGTGGTTGG TTAATAACAA GTTGGCTATC TATAGCTACG
GCTGGTAGTT TAGGTTTGTT AGGAGATTTT AATCCTGCAT TTAGAATATT TTTTCACCAA
CAATCTATTG TGAAAATTCT CCAAAATCAT CCGGTCTCTT TTGTGAATTT AGAGGGTAAA
AATGCTGTAT TGATTAATTT TTATACTCCT ATTCATGGTC AGGAAGTAAA GTCTGTTTCT
CAATTACCAC CTTTGAGTTA TGCTTGGATT TATACGCCTA ATTCTGGTAA TTTAGTTAAA
AGTTATCGCG TTGTTGGTAG TGTGAAAGAT TATCAATTAA TTCAAGTTTT ATCATCTCTA
GCACCTTCAC CATAA
 
Protein sequence
MLYKLPLFVA IKQRILLLKQ FPHISLLIWL IPLLLFTSGE TSLMAHDETL YARRARLIFD 
SGDWIAPWKT AHHKTPGFYW LIAIFYQLFG VSDTSARIPS MIAGILSILV IYEIAKILLY
QKLAYLSAAI LSVEFLWLQY CRLTAPDVPM ILLVCLAILC LLKAEIYPKY RVIYGFIVGV
SFGLGFLMRS FMICLPIAAL LPYLIWEHRR HRHLTNPMLY LGGLVGLIPT LVWLWFNWQR
YGGDSVGHLL GFVAELGSSK RPGNGILYYL WNVPLKSFPW CLFSILGLVL VIRKPISRYQ
LLLVGFPIFL FTEITIFSTR LMHYSLCLYP FIAMLAAVGL DWLAKIYQEG KANKINAHLP
RNISYSFGVL GIILIIAGIV VLVANLVDIK KYAIVALVTG LGCLIVPLIW ILRYSLHQNF
LTAPYWVSGW LITSWLSIAT AGSLGLLGDF NPAFRIFFHQ QSIVKILQNH PVSFVNLEGK
NAVLINFYTP IHGQEVKSVS QLPPLSYAWI YTPNSGNLVK SYRVVGSVKD YQLIQVLSSL
APSP