Gene Aazo_5244 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_5244 
Symbol 
ID9343109 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014249 
Strand
Start bp7456 
End bp8556 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content45% 
IMG OID 
Productintegrase family protein 
Protein accessionYP_003723388 
Protein GI298501391 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTATTT CCCAGATGGA TATTACTAAT GTTGAGGTAA GCAACGTAGA AAGTCTCCCT 
CCCTCCGCCA GTAGAGTTGT ATGTCATGAT ACTTCTGCAT CTGTTACCGC TGAATTAGTA
GAGGAGGTGA TTCGCCAATA TTATTACCTT ACACCTGCTG TAGCAAATGA TGAGGAGTTG
ATTTTGGCTT GGGCAGGGTC ACAGAACCGT GAGCAAACTA AGCGAAAATA CTATCGCTTT
GGTCAAAAAT TACTCACTTG GTTGAAAAAT CAAGGTGTAC GAGATTTACG GCTGGTTCAG
GCACCAAAGT TACTGGAATT TATTGCTAGT TGGGGTGAGG TTTCTCCTTA TACCAAATCT
AACCAGGTGC TAATATTGCG ATCACTATGG AGCTATGGTC ATGGGGAGAA TGTTGGCTAT
TTCTTACGGA ATATTACTAG TAGTATAGAT TACGATAATT TCAGTGACTT ACCAAAGGCA
GAGCGTTATT TAGAAGATTG GGAGATGGCA CAGTTGGCTG ATGTTGCCCA ACGATTGAGG
GAGCAGTACT GGCTAGTTTT TTCTTTGCTT TTTTATAGTG GGATGCGGGT GGGTGAGGTT
GGTCGGGTGA CGGTTCCTGG TGATAAACCG GGTCAGCCGA AGGAAGATTA TCCGGGTTTA
TATTGGCACA ATTTTAAATG GCAGCCCGAC CCGATACCAG AAGATAGTTC CAGGGGATAT
TACACAATTA AGTTTCGGGG CAAGGGGGGA AAGTACCGGG AAATTGGTTT GGATCACGAA
ACTTCACGGA TCTTTAAGAA GTACCGGGGG ATGGCAGGTG AAAAGATGCC AGTGTTTCCG
AATATGTCAC CTAACCCGAA AAAGCGGGGT TTACCGTTGA GTGACCGGGC AATTAAAAGG
TTGATTCAGG ATATATCTGA GGTGGCGAAG GTAAAGTTTT CTTGCCACTG GTTACGGCAT
TCTCACGCAT CGCGGGCGGT GGATAGTAAA TCACTGTTTG AGGTGCAAGA CCAGTTAGGG
CATAGTAAGA GCGATACTAC TAAGACCTAT GTTCGTTCTA AAAAGGATGC GGGAACGGGG
ACTGTATTAC CGAGGTTTTG A
 
Protein sequence
MVISQMDITN VEVSNVESLP PSASRVVCHD TSASVTAELV EEVIRQYYYL TPAVANDEEL 
ILAWAGSQNR EQTKRKYYRF GQKLLTWLKN QGVRDLRLVQ APKLLEFIAS WGEVSPYTKS
NQVLILRSLW SYGHGENVGY FLRNITSSID YDNFSDLPKA ERYLEDWEMA QLADVAQRLR
EQYWLVFSLL FYSGMRVGEV GRVTVPGDKP GQPKEDYPGL YWHNFKWQPD PIPEDSSRGY
YTIKFRGKGG KYREIGLDHE TSRIFKKYRG MAGEKMPVFP NMSPNPKKRG LPLSDRAIKR
LIQDISEVAK VKFSCHWLRH SHASRAVDSK SLFEVQDQLG HSKSDTTKTY VRSKKDAGTG
TVLPRF