Gene Aazo_3745 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_3745 
Symbol 
ID9341550 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp3805535 
End bp3806692 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content41% 
IMG OID 
Productgroup 1 glycosyl transferase 
Protein accessionYP_003722411 
Protein GI298492234 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAATTG CCCTATTCAC CGAAACTTTT TTACCCAAGG TTGATGGGAT TGTTACCCGT 
CTACGTCACA CTGTTGATCA TCTCCAACGG GATGGAAATC AAGTCTTAGT ATTTGCCCCG
GAAGGTGGAA TTACAGAACA CAAAGGAGCG AAAGTTTTCG GAGTTAGTGG TTTTCCTTTA
CCTCTTTATC CAGAGTTAAA ATTAGCTCTG CCTCGTCCTG CCATTGGTCA TGCTTTAGAA
GAGTTTCAAC CGGATATTAT TCATGTTGTC AATCCTGCCG TTTTGGGATT ATCGGGTATT
TTTCATAGTA AAGTCTTAAA AATTCCTTTG ATCGCTTCTT ACCATACCCA TTTACCTCAA
TATCTACAAC ATTACGGTTT GGGGATGCTA GAAGGATTAC TATGGGAATT GCTTAAAGCT
GGACACAATC AAGCAGCCTT AAATTTGTGT ACCTCGACAG CGATGATAGA AGAACTCTCT
GAACATGGGA TTGAAAGATT AGATTTGTGG CAACGGGGAG TAGATACAGA ATTATTCCAT
CCTAATTTAG CCAGCGAGGA AATGCGATTA CACCTCACGC AAAATCATCC AAAAAGCCCC
TTGTTGCTGT ATGTTGGTCG TCTTTCAGCC GAAAAAGAAG TTGAACGCAT TAAACCCATC
TTAGAAGCCA TTCCTGATGC ACGATTGGCA TTAGTAGGAG ATGGACCAAA CCGCCAAAAT
TTAGAAAGGC ATTTTGCAGG TACAAATACT CATTTTGTTG GTTATCTGAT GGGTAAAGAG
TTGGGTTCAG CTTTTGCCAG TGCGGATGCT TTTATTTTTC CTTCCCGTAC AGAAACATTA
GGCTTAGTGC TACTAGAAGC AATGGCCGCA GGTTGTCCAG TAGTTGCAGC CCGTTCAGGT
GGCATTCCTG ACATTGTTAC AGATGGTATA AATGGTTATC TTTTTAACCC AAAAGCTGAT
ATTCAAGAGG CTATTGATGT TACTATCAAG TTGTTAAAAC AAAGACAAGA AATAGCGATT
ATCCGTAAAA ACGCCCATAC AGAAGCAGAA AAATGGGGAT GGGCTGCTGC TACACGACAA
CTACAAGATT ACTATCAAAA GGTAATAGGA GTCAGGAATC AGGAGTCAGG AGTCAGGAGT
CAGGAGAAGA TTCATTAA
 
Protein sequence
MRIALFTETF LPKVDGIVTR LRHTVDHLQR DGNQVLVFAP EGGITEHKGA KVFGVSGFPL 
PLYPELKLAL PRPAIGHALE EFQPDIIHVV NPAVLGLSGI FHSKVLKIPL IASYHTHLPQ
YLQHYGLGML EGLLWELLKA GHNQAALNLC TSTAMIEELS EHGIERLDLW QRGVDTELFH
PNLASEEMRL HLTQNHPKSP LLLYVGRLSA EKEVERIKPI LEAIPDARLA LVGDGPNRQN
LERHFAGTNT HFVGYLMGKE LGSAFASADA FIFPSRTETL GLVLLEAMAA GCPVVAARSG
GIPDIVTDGI NGYLFNPKAD IQEAIDVTIK LLKQRQEIAI IRKNAHTEAE KWGWAAATRQ
LQDYYQKVIG VRNQESGVRS QEKIH