Gene Aazo_1603 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_1603 
Symbol 
ID9339395 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp1676138 
End bp1677544 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content40% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003720904 
Protein GI298490727 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACACCA GATTAAAAAT GGCTTTGCTT CTTCACTCTG GAGAATCAGG ATTTGCTATA 
GTAATCGCTG TAGCACTGGG ATTAATTATG ATCTTAGTGG CGTTAACCAT GACGATAAGA
TCTCAAGGAG ATCAAATATT AGCGTCAACC CGAAAAGAAA CAGAGCGGTC ACTGGCAGCC
GCAGAAAAAG GTGTTTCCTA TTACCAAGCA TTCCTCAACT CTAACCGGCT ACTGCCCAGA
TATCCGGATT GTACTCAAGA TCGTACATCT TCTGGTACTT GTCCTGATTC TGGATCTCAA
AAAAGCTGGT CAAATCCCTC AGCTATTCCC GGAATGAGTG ACAGCCCTTG TACTGGTAGT
TCAGCATCCA CAGCAACAAT ACAAGGAAAT GCCGACACAA ACCAGTGGAA TCTGGTTGAT
ACAAATGACT CAAGCAAGGG ACAATATAAA CTGGTTTCTT ATAAAATTGC TGATTCTGGA
GATGATACTT TGCAGGCAAT GGGCATATTA ACCATTGAGG GAAGAATAAC TAATTCAACA
GCTAACAGCA AAGCTAATAA ATCTATTAGT AGGGTACAGG TAGCTATTCC TGTTAACTTA
CCTAGTATTA ATAGTGTTCC AATTCCTGGA GTATGGATTG GTGATTCTAC TACTAATAGT
GGTACAGGTG GTAATACAAT CCAAGGTAAT GTATTAGTTA ATAGTTGTAA TGTCACCCTT
TCAGATATCG AAATCGATAG GAGCACTCCA CAGTATTCGG CAATGTATAC AAATTTGAGA
ATGCCATCAG TCCCAACAAT GCCAGAAGCG GCAAATAATT CTGTCACTCC ACGGGTTGCA
GGTACTATTT CCTTGGGAAC AATAAATACT GATACCACTC TCCCTCGGCT CACTGGTGAT
ACACCTGATT TGCCTATAAC ATTTAACGGG CAATCAAGAT ATGTATATTT AGCGACTGAT
ATAGTCAGAA GTGGGGGTTC AACAGCATTG ACAATTACAC CAGGGAAAAA AGTAGTTATA
TTTTTATCTG GCAATACAAG TAAAAACGTT GATATTTATC ATGAATGTGG TAGTGTTAGC
GGTTGTCTAC CTACCGATTT TCAAATTTTT GGCACTAAAC CTAGTGGTGG TGAAATATGT
CTAAATGGGA ATCATCTGCT AGATGCTTTT ATCTTAGCAC CTACTTATAC AGTCGGGGTT
GCAGGGGGAG GCAATAGTGG GGGCATAAAT GGTTCTATTT GGGCAAACCA GTGGAGTAAT
GATTCAGGTT GTGGTTCTAA CTCTAACAAC GTAGTTGTTA GACAATCAGC AAATTGGAGT
GAATTAACTG GGCTCCAACC AGATAGTAGT GAATTACCAC TTTCAATTAA ATCTATAAGG
TCTTGGAAAC GAAATGTGGT GAATTAA
 
Protein sequence
MNTRLKMALL LHSGESGFAI VIAVALGLIM ILVALTMTIR SQGDQILAST RKETERSLAA 
AEKGVSYYQA FLNSNRLLPR YPDCTQDRTS SGTCPDSGSQ KSWSNPSAIP GMSDSPCTGS
SASTATIQGN ADTNQWNLVD TNDSSKGQYK LVSYKIADSG DDTLQAMGIL TIEGRITNST
ANSKANKSIS RVQVAIPVNL PSINSVPIPG VWIGDSTTNS GTGGNTIQGN VLVNSCNVTL
SDIEIDRSTP QYSAMYTNLR MPSVPTMPEA ANNSVTPRVA GTISLGTINT DTTLPRLTGD
TPDLPITFNG QSRYVYLATD IVRSGGSTAL TITPGKKVVI FLSGNTSKNV DIYHECGSVS
GCLPTDFQIF GTKPSGGEIC LNGNHLLDAF ILAPTYTVGV AGGGNSGGIN GSIWANQWSN
DSGCGSNSNN VVVRQSANWS ELTGLQPDSS ELPLSIKSIR SWKRNVVN