Gene Aazo_4456 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_4456 
Symbol 
ID9342258 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp4541507 
End bp4542880 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content45% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003722884 
Protein GI298492707 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAATCG GTTTTCTCAG ACAAGCTATC AACGCCCTAC AACTCCAGTC TCATGGACGC 
ACTTCCCATC GGGTCAACCA GTGGTTTAAG TGGTTATCAC CTGGATTATC AGTAAAACGC
TGGTTTTTCG TCAGTGTTGG GGGTTTTTTA CTGGCAAGTT TGGGGTTGGC TATTTGGATT
AAGCTAACCC CGATTTTCTG GATATTAGAG TTTCTCAGAG GTTTGCTGGG TTTCCTCACG
GACATACTAC CCAACTATAT CAGCGGACCT TTGGTTTTAC TGTGCGGTAT CTTACTGCTG
CTGTGGGGAC AATCCCGCAC CGTAGGTTCA ATTACTGAAG TGCTAAGACC ACAGGGGGAT
GAGGAAGAAC TGATAGATGT TTTGCTGGCA CATCGCCGTT TATACCGGGG TCCGAAAATT
GTCGTCATTG GTGGCGGTAC TGGACTGTCT ACTTTACTCA GGGGCTTAAA AACCTACAGT
GCTAATATTA CTGCTATTGT TACCGTGGCT GATGATGGTG GTTCTTCTGG CAGGTTGCGT
CAGGAATTTG GCGTTTTACC TCCTGGGGAT ATTCGCAATT GTTTGGCTGC ACTAGCTGAT
GAAGAAAAGT TATTAACAGA ATTGTTTCAA TATCGTTTTC GCGCAGGAGA TGGGTTGACA
GGTCACAGTT TTGGTAACTT GTTTTTAACT GCCATGACTG ATATTACTGG AGATTTAGAA
AGGGCAGTTG CAGCTAGTTC CAAAGTTCTT GCCGTGAGGG GACAAGTTTT ACCCGCAACC
CTCAGTGATG TTCGTCTTTG GGCAAAATTA GAAGATGGGC GCCGGATTGA AGGTGAGTCC
AGCATTCCCA AAGCTGGGGG AAAAATTGTT CAAATTGGCT GTATTCCTGA AAATCCTCCC
GCCTTACCCG CAGCGATTAA AGCAATTAAA GAAGCTGATT ACATTATTAT TGGACCGGGC
AGTTTGTATA CTAGTCTAAT ACCTAATTTA TTAGTACCAG AAATTGCCGA TGCGATCGCA
GCCCAAAATA TTCCCCGTAT CTATATCTGC AATATCATGA CCCAACCGGG AGAAACAGAA
GGATACACCG TAGGCGAACA CATCCAAGCC ATTGATAAAG CTTGTGGCGA CAGAAGGCTG
TTTGATGCCG TACTAGTACA TAAAAAAACC CCATCAGCCC AAGCCCTCAT TCGCTACGCC
CAGCAAAATT CCCATCCCGT TTTCCTAGAC CGAGAAACCG TCATCAAACT AGGAAGAAGA
ATAGTCCCCT CCAACATCTT GTATGAAGAC GAAACCGGAT TTGTTCGCCA CGACCCACAA
AAACTAGCCA AGGTTTTATT GAAATGGTAT AATGGAGCGC AGCATGGGAA GTAA
 
Protein sequence
MSIGFLRQAI NALQLQSHGR TSHRVNQWFK WLSPGLSVKR WFFVSVGGFL LASLGLAIWI 
KLTPIFWILE FLRGLLGFLT DILPNYISGP LVLLCGILLL LWGQSRTVGS ITEVLRPQGD
EEELIDVLLA HRRLYRGPKI VVIGGGTGLS TLLRGLKTYS ANITAIVTVA DDGGSSGRLR
QEFGVLPPGD IRNCLAALAD EEKLLTELFQ YRFRAGDGLT GHSFGNLFLT AMTDITGDLE
RAVAASSKVL AVRGQVLPAT LSDVRLWAKL EDGRRIEGES SIPKAGGKIV QIGCIPENPP
ALPAAIKAIK EADYIIIGPG SLYTSLIPNL LVPEIADAIA AQNIPRIYIC NIMTQPGETE
GYTVGEHIQA IDKACGDRRL FDAVLVHKKT PSAQALIRYA QQNSHPVFLD RETVIKLGRR
IVPSNILYED ETGFVRHDPQ KLAKVLLKWY NGAQHGK