Gene Aazo_4736 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_4736 
Symbol 
ID9342543 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp4840822 
End bp4842213 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content42% 
IMG OID 
Productdiaminopimelate decarboxylase 
Protein accessionYP_003723053 
Protein GI298492876 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.775184 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTATCGA CTCACCCCCT CGGGATTCAA GCTTCTGGCA GTCAATATTT ACCTCAAAAG 
CTCAACAACA CAACTCTTTC ACCTAATCAA GAACTCTTAC CCTTGAGTGC TAGAGTGAAT
CGTCATGACT CCCTAGAAAT CGGTGGGTGT GATGTCACAA CGCTGGTTGA GCAGTTTGGT
TCACCTTTAT ATATTTTAGA TGAAGAAACT CTACGGACAG CCTGTAGGCA ATACCATGAT
ACTTTCAAAC AACACTACAA AGACGAATCT CAAGTATTGT ACGCTTCTAA AGCATGGAAT
TGTCTAGCTG TTTGTGCCAT TGTTGCCTCG GAAGGCTTGG GAATTGATGT GGTATCTGGT
GGTGAACTAT ACACTGCGCT CAATGCGGGT GTGACTCCAG ATAAAATCTA CCTACATGGC
AATAATAAAT CTCGTGATGA GCTAGTTTTA GCTATTGAGT CAGGTTGTAC AATTGTTGCG
GATAACTGGT ACGAATTAAA AACTTTGGTA GAATTGGTAA CAAAGTCTTC CCCAGTTCGC
ATTATGTTGC GGTTAACTCC AGGGATTGAA TGTCATACCC ACGAATATAT CCGCACCGGA
CATTTAGACA GTAAATTTGG TTTTGATCCC AGTGATTTAG ATGAGGTATT TGCCTTTGTT
AGCAAACAAC CAAGTTTAAA CTGTGTAGGG TTACACGCTC ACATAGGTTC ACAAATTTTT
GAACGTCAAC CCCATCGAGA TTTGGCTGCT GTGATGGTAC AGTGGTTGCG GGACGCAGCC
AAATATAATT TGGAATTGAA AGAGTTAAAT GTAGGTGGTG GTTTAGGGAT TAAGTACACA
GAATCAGATG ATCCCCCAAG CATTGAAGAA TGGTCAAAGG CAATTTGTGA AGTAGTTCAA
CAAGCTTGTG CTGCTGAAAA TTTGCCCTTA CCTAAATTAC TCTGTGAACC AGGGCGATCG
CTGATTGCCA CAGCTTGCGT TACCGCCTAC AGTATTGGTT CAGCTAAAGT TATTCCTGAT
CTTCGTACTT ATGTGACAAT TGATGGAGGA ATGTCTGATA ATCCCCGCCC CATCACCTAC
CAATCAGTTT ATCGGTCAGT GGTTGCTAAT AAAATGTCTG CTGCTTTAAC AGAAACAGTC
ACATTGGCTG GTAAACATTG CGAATCAGGA GATATTCTGA TCAAAAATGC CCAACTGCCT
AAAACTGAAC CAGGTGATAT TCTCGTCGTT ATGGGAACTG GTGCCTACAA TTACAGTATG
GCATCTAACT ACAACCGCTT GCCCCGACCA GCAGCTGTTT TAGTGGCGAA TGGCGAAGCA
AACTTAATTT TGCAACGCGA AAATTATCAA GACATAATTC GACAAGATTG CCTACCAGAA
AGACTGAAAT AG
 
Protein sequence
MVSTHPLGIQ ASGSQYLPQK LNNTTLSPNQ ELLPLSARVN RHDSLEIGGC DVTTLVEQFG 
SPLYILDEET LRTACRQYHD TFKQHYKDES QVLYASKAWN CLAVCAIVAS EGLGIDVVSG
GELYTALNAG VTPDKIYLHG NNKSRDELVL AIESGCTIVA DNWYELKTLV ELVTKSSPVR
IMLRLTPGIE CHTHEYIRTG HLDSKFGFDP SDLDEVFAFV SKQPSLNCVG LHAHIGSQIF
ERQPHRDLAA VMVQWLRDAA KYNLELKELN VGGGLGIKYT ESDDPPSIEE WSKAICEVVQ
QACAAENLPL PKLLCEPGRS LIATACVTAY SIGSAKVIPD LRTYVTIDGG MSDNPRPITY
QSVYRSVVAN KMSAALTETV TLAGKHCESG DILIKNAQLP KTEPGDILVV MGTGAYNYSM
ASNYNRLPRP AAVLVANGEA NLILQRENYQ DIIRQDCLPE RLK