Gene Aazo_1744 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_1744 
Symbol 
ID9339537 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp1806343 
End bp1807590 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content37% 
IMG OID 
Producttransposase IS4 family protein 
Protein accessionYP_003721003 
Protein GI298490826 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGAAA CTACTCCTGC AGCAATGCCA CCATGCTTTG AGAGATGGTG TCAAAGGTTT 
GATAATGTAT TTACTCACAA GGCGCAGAAA AGAGAGTTTA GGAATTATTT AGGGGGATTA
TTACGTGAAA GTGAGAGAAA AAACCTACTT CAAATGGCAG AGAATGCCCT AGGGGTGACC
TACCACCGAT TACACCACTT TTTAACTGAA GCACCTTGGT CCATTTCCCA AGTCAATGAC
CGTCGATTAG AGATTATGAA TAAGTGTAGT CAGACGAGAA TCACCAGAGG ATTTAGCTTA
ATAATTAATG ATTATGTCCA TAGAAAAAGC GGGAACTTGA GGGATGGAGT AGGAAGACAA
TATAATATTG GAGAAATTGG GAACACGGAT AATGGGATAG TAGTAGTAAC AACACATCTA
TATGATGGCA GTAAAAGCTT ACCATTAGAT ATAGAGTTAT ATCACCACGG TTATGATTCT
TTACCCAAAG GGAAAGAAGA ACCTCTATTT GAGAAGAAAC ATGAGTTAGG AATTAAATTG
ATAGACCTAA CGTTAAGCCG GGGTTATCAA CCAGGAATAG TAATTATAGA TGCTGCATAT
GGCAACAATA CATCTTTCTT ATTAAAGATA GAAAATCGGC ATTTAAAGTA TTTAGGAGGA
TTAGCTGGAA ATCGCAAAGT CCTTACCAGT GACCAAGAGG ATAGTCCACA AATAATTAGG
TTAGATGAAT TAGCACAAAG TTTACCCCAA ACGGCTTTTA CAGAAATTGA ACTGGAGTTA
GATAAAACCA AAACATTATG GGTAGTAACT AAAGAAGTAG AAATATTGGG CCTAAGTGGA
AAGCGGAATA TTGCTATTGT CATTGACGCT TCTACTGTCT CTCAAGCCAC TGATATTAAC
TACTTTATTA CCAATGTTTC TTCATCAGTT ATCACACCCC AGTGGATAGT CAATACATAT
TCTCAAAGAA ATTGGGTAGG AGTTTTCTAC AGGGAAGCCA AGGGATGGTT AGGACTCGAA
GAATATCAAG TTCGAGATAA CAGTAGTTTA CTGCGCCATT TTATTTTGGT TTTCTGTGCC
TACACTTTTA TTCTTTGGCA TCAGTTAACT GGAGGATTAA GACGAAGGTG GGCTAAGAAA
CCTTTGAATA CTTTTACTGA GGCTTTAGAA GCGTTTAGAA CAGCCATATC TTTTCTATTT
ATTGATTGGT TCAACTTGAA TCCGGACGTC TTTCCTTCTT ACTTATAA
 
Protein sequence
MKETTPAAMP PCFERWCQRF DNVFTHKAQK REFRNYLGGL LRESERKNLL QMAENALGVT 
YHRLHHFLTE APWSISQVND RRLEIMNKCS QTRITRGFSL IINDYVHRKS GNLRDGVGRQ
YNIGEIGNTD NGIVVVTTHL YDGSKSLPLD IELYHHGYDS LPKGKEEPLF EKKHELGIKL
IDLTLSRGYQ PGIVIIDAAY GNNTSFLLKI ENRHLKYLGG LAGNRKVLTS DQEDSPQIIR
LDELAQSLPQ TAFTEIELEL DKTKTLWVVT KEVEILGLSG KRNIAIVIDA STVSQATDIN
YFITNVSSSV ITPQWIVNTY SQRNWVGVFY REAKGWLGLE EYQVRDNSSL LRHFILVFCA
YTFILWHQLT GGLRRRWAKK PLNTFTEALE AFRTAISFLF IDWFNLNPDV FPSYL