Gene Aazo_4706 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_4706 
Symbol 
ID9342513 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp4807935 
End bp4809275 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content40% 
IMG OID 
Productsun protein 
Protein accessionYP_003723032 
Protein GI298492855 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTAATC CCCGCCAACT TGCCTTTATT GCCCTCAAAG AAGTACATAA AGGGGCTTAT 
GCTGATGTAG CTCTAGACCG TGTGCTACAA AAGTTTAAAT TACCCGACAA TGATCGTCGT
TTAATGACAG AATTAGTCTA TGGTAGTGTT AGAAGACAAC GCACTCTAGA TACTCTAATT
GATAAATTAG CTACAAAGAA GGCACACCAA CAACCACCAG AACTTCGTAC TATTTTACAT
CTCGGTTTTT ATCAATTGCG TTATCAAGAA AAGATCCCTG TTTCTGCTGC TGTGAATACC
ACAGTTGAAC TAGCAAAGGA AAATGGCTTT TCTAGTTTAA CTGGTTTTGT GAATGGTTTG
TTACGTCAAT ATCTACGTCT TATAGAAAGT TCATCAGAAC CATTAAAGTT ACCAGAAAAT
CCGGTAGAGA GATTGGGAAT TTTACACAGT TTTCCTGATT GGATAATTGA GGTGTGGTTA
GAACAATTGG GTCTTAAAGA AACAGAAAGA CTCTGTGCAT GGATGAATAA AACCCCAACT
ATTGATTTAC GGGTAAATAT CCTTCGCAGT TCCCTGGAAA AAGTGGAATC AGCTTTTAAA
TCTGCTGGTG TTTTAGTTAG ACCTATTCCC TATTTACCTC AAGGTTTAAG ATTAATTAGT
AGTACCGGGC CAATTAAAAA TTTACCTGGT TTCCGAGAAG GTTGGTGGAC TGTTCAAGAT
AGTAGCGCCC AATTAGTTAG TCATTTGCTT GACCCAAAAC CGGGCAATGT GGTGATTGAT
GTTTGTGCGG CTCCAGGGGG AAAAACCACC CATATTGGTG AGTTAATGGG AGATAAAGGT
AAAATCTGGG CTTGTGATCA AACTGCTTCC CGGTTACGTA GACTCAAGGA AAATGTCCAA
CGTCTACATT TAGAATCTAT CGAAATCTGT ACAGGGGATA GCCGCAATTT GACCCAATTT
AACAACATTG CTGATTGTGT ATTATTAGAT GCACCTTGTT CCGGTTTAGG AACTATGCAC
CGCCATGCTG ATGCACGTTG GCGACAAACA CCGTCTTCTG TTCAAGAACT CTCCCAACTA
CAGAAAGAAC TGATATCACA TACAGCTAAT TTTATCAAGG TTGGAGGGGT TTTAGTTTAT
GCCACTTGTA CACTCCATCC CATGGAGAAT GAAGAGGTAA TTTCTCAATT TTTAGCTGTA
AATCCCCATT GGCAAATTGA ATCTCCTGGC TCGGATTTAG TTGATATTGC TTCTCCAGGG
TGGTTAAAAG TCTGGCCTCA TCAACGGGAT ATGGATGGTT TTTTCATGGT GCGCTTAAGA
AAAACCAAGG ATTCCGAGTG A
 
Protein sequence
MTNPRQLAFI ALKEVHKGAY ADVALDRVLQ KFKLPDNDRR LMTELVYGSV RRQRTLDTLI 
DKLATKKAHQ QPPELRTILH LGFYQLRYQE KIPVSAAVNT TVELAKENGF SSLTGFVNGL
LRQYLRLIES SSEPLKLPEN PVERLGILHS FPDWIIEVWL EQLGLKETER LCAWMNKTPT
IDLRVNILRS SLEKVESAFK SAGVLVRPIP YLPQGLRLIS STGPIKNLPG FREGWWTVQD
SSAQLVSHLL DPKPGNVVID VCAAPGGKTT HIGELMGDKG KIWACDQTAS RLRRLKENVQ
RLHLESIEIC TGDSRNLTQF NNIADCVLLD APCSGLGTMH RHADARWRQT PSSVQELSQL
QKELISHTAN FIKVGGVLVY ATCTLHPMEN EEVISQFLAV NPHWQIESPG SDLVDIASPG
WLKVWPHQRD MDGFFMVRLR KTKDSE