Gene Aazo_4466 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_4466 
Symbol 
ID9342268 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp4557172 
End bp4558929 
Gene Length1758 bp 
Protein Length585 aa 
Translation table11 
GC content44% 
IMG OID 
Productsurface antigen (D15) 
Protein accessionYP_003722890 
Protein GI298492713 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGTCTCAAC CAACATTAAA CAAGTTGTTA AATTTCATCT TTAAACTAAA ACTTCTAAAA 
CTGGGCTTGA TTCTAGCAGC GGTATTATGC AGCTACCATC AAGCAAACGC TCAAACTTCC
AATTCTCCGA CTTCACCATC TGGACGCTTA GAAGATATTC CTGTTGCTCC CTTACTGTCG
GATGTGTTAC CTCAACCATC AGATCAGGAG CAATTACTTC CACTTGTGCC ACTACCGGAA
CAGCCAGCAC CAGGAGAGGA TGATGCTAAT ACCAAATTCC AGGTTGATCG CATTAAAGTT
GTCGGTAGTA CAGTTTTCAC ACCAAAACAG TTTGCGGCGA TTACATCTCC CTTTGTCGGA
CGGGAGGTCT CCTTTGTAGA GATATTGCAA ATTAAAGATG CCATCACTAA GCTATACACT
GACCATGGTT ATGTCACAAC AGGGGCTTTA ATTACACCAC AGACATTAGA AGCTGGAGTC
CTTAAACTTC AGGTGGTAGA GGGCAGTTTA TCAGAAATTA AAATCGTTGG TAATCGGCAA
TTGCGTAGTC AATACATTCG CGAGCGCATC CGAATAGGTG CTGGTAAACC CTTGAATATA
CCGCGCTTAA TCGAAAAGCT GCAACTCCTC CGCCTCGATC CGCGCATTCA AAACTTGTCA
GCTGAGTTAC AGATGGGTGT GCATCCCGGC AGCAATATAT TGCAAGTAGA GGTTCAAGAA
GCTGACACTG TCTCACTCAT AACAACTTTA GATAATGGGC GATCACCTAG TGTCGGTAGT
TTTCGTCAGG GTGTGGATTT CAAAGAAGCC AATTTACTAG GTTTAGGGGA CACGCTCAGT
GTGGGATACA CTAATACCGA TGGCAGTGAT ACAATCAATC TGAATTACAA ACTACCGATT
AACGCTGGCA ATGGTACTGT CTGGTTTGGT TTTAACCAAG GATGGAATCA TGTAATTGAA
GAACCGTTCA GCATCCTTGA TATTCAATCA CACACTAGGT CTTACGAATT CGGCTATCGG
CAACCACTGA TACAAAAGCC AACCCAGGAA TTGGCAGTGG AACTATCGTT ATCACACCAA
GAAAGCCACA CTGAGTTAGG TCTTAATGAT ATTGGCGGGT TTCCTTTATA CGTCGGTGCA
GATGCGGACG GAAAAACTAA GATTTCTGTC CTGCGTTTTA CTCAGGAGTA TAACCAACGC
AGTAATCAGC AAGTTATTGC AGTGCGATCG CAATTTAGTT TGGGGGTAGA TTGGTTCGAG
GCTAATGTCA ATGAAGATAA GCCAGATAGC CGCTTTTTCG CGTGGCGAGG ACAGGCACAG
TGGGTGCAAC AGTTAGCACC AGACACTCTG TTTTTAGCTA AGGGCGATTT ACAACTAGCA
GCAGATACTG TCGTGCCTTT CGAGCAATTT GGCATTGGTG GACAACTAAG TGTACGAGGC
TATCGCCAAG ATACATTACT TACTGACAAT GGCATATTAT TTTCAGCTGA ATTTCGGTTG
CCAATTTTGC ATACCTCCAA TCTGGGAGGA TTGCTACAAC TGACCCCATT CATCGACGTG
GGTCAGGGCT GGAATACTAA GGGTGACAAT CCATCACCTA GTATGTTAGT TGGTACTGGG
TTAGGACTAC TGTGGAAACA GAGTAATAAC TTCTCAACCC GTCTGGATTG GGGTATTCCT
CTCACATCAG TAGATAGCGA AAAGCGCTCG CTTCAGGAAA ATGGATTGTA CTTCTCCGTG
CAGTATTCGC CATTTTGA
 
Protein sequence
MSQPTLNKLL NFIFKLKLLK LGLILAAVLC SYHQANAQTS NSPTSPSGRL EDIPVAPLLS 
DVLPQPSDQE QLLPLVPLPE QPAPGEDDAN TKFQVDRIKV VGSTVFTPKQ FAAITSPFVG
REVSFVEILQ IKDAITKLYT DHGYVTTGAL ITPQTLEAGV LKLQVVEGSL SEIKIVGNRQ
LRSQYIRERI RIGAGKPLNI PRLIEKLQLL RLDPRIQNLS AELQMGVHPG SNILQVEVQE
ADTVSLITTL DNGRSPSVGS FRQGVDFKEA NLLGLGDTLS VGYTNTDGSD TINLNYKLPI
NAGNGTVWFG FNQGWNHVIE EPFSILDIQS HTRSYEFGYR QPLIQKPTQE LAVELSLSHQ
ESHTELGLND IGGFPLYVGA DADGKTKISV LRFTQEYNQR SNQQVIAVRS QFSLGVDWFE
ANVNEDKPDS RFFAWRGQAQ WVQQLAPDTL FLAKGDLQLA ADTVVPFEQF GIGGQLSVRG
YRQDTLLTDN GILFSAEFRL PILHTSNLGG LLQLTPFIDV GQGWNTKGDN PSPSMLVGTG
LGLLWKQSNN FSTRLDWGIP LTSVDSEKRS LQENGLYFSV QYSPF