Gene Aazo_5007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_5007 
Symbol 
ID9342814 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp5127015 
End bp5128364 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content45% 
IMG OID 
Product3-phosphoshikimate 1-carboxyvinyltransferase 
Protein accessionYP_003723243 
Protein GI298493066 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGGCTG CTGTTATAAT TTTAGAAACT CAAGAAAACG CTTCTCATAA CTTAATTATT 
CAGCGACCCC CTGCTGGTCT GTCTCTACAG GGTCGCATCA GAGTTCCTGG GGATAAATCT
ATTTCCCATC GGGCTTTGAT GTTGGCTGCG ATCGCTGAAG GTGAAACTCA GATTCAAGGA
CTGCTTTTAG GTGAAGACCC CCGCAGCACT GCTAACTGTT TCCGGGTGCT TGGTGCGGAA
ATTTCTGAAT TAAATACGGA ATTTGTGCGG GTAAAAGGGA TTGGACTGGG GAATTTTCAA
GAACCTTTTA ATGTGTTGAA CGCAGGTAAC TCTGGTACAA CCATCAGATT GATGTTAGGC
CTTTTAGCTT CCCATCCAGG GCGATTCTTT ACAGTTACAG GTGATGATTC GTTGCGAGCG
CGTCCTATGT CCCGTGTTGT CAAACCTTTA CAACAAATGG CAGCAGAAAT TTGGGGACGC
AAAGGTAACT CGTTAGCACC CTTAGCAATT CAAGGACAAG CCCTCAAACC TATTCATTAT
CATTCTCCTA TCGCTTCGGC GCAAGTGAAA TCCTGTATTT TGCTTGCAGG TTTAAACACC
GAAGGTCAAA CTACCTTCAC CGAACCCGCT TTATCAAGGG ATCACAGTGA ACGGATGTTA
CGAGCTTTTG GGGCAGAATT GAGTATAGAC CCAGAAACCA ATAGTGTTAC CGTGACTGGT
AATGCTAAAC TCTACGGTCA AAAAGTGATT GTTCCCGGAG ATATTAGTTC AGCCGCTTTT
TGGTTAGTTG CAGGGGCTAT TGTTCCCGGT TCTGATTTGG TTGTGGAAAA TGTCGGTGTG
AATCCCACCC GCACCGGAGT TTTAGAAGCT TTATCAATGA TGGGAGCAGA TATTGAACTG
GAAAATCAGC GAGAAGTTGC TGGGGAACCG GTCGCAGATT TGCGGGTGTG TTCTAGTCAG
TTAAAAAGTT GTACCATTGC AGGGGATATT ATACCCAGAT TAATTGATGA AATTCCCATT
TTGGCGGTAG CTGCGGCTTT TGCCCAAGGG ACAACCATTA TTCGAGATGC AGCAGAGTTG
CGAGTCAAAG AGAGCGATCG CATTACTGTG ATGGCACAAC AACTCAATCA AATGGGAGCA
AAAGTGACGC AATTACCTGA TGGAATGGAG ATTACTGGTG GTACTCCTTT GATGGGTGCT
GAGGTAGACA GTTATACAGA TCATCGGATA GCTATGAGTT TAGCGATCGC TGCTCTTAAT
GCTAGTGGAA CTACTACTAT TCACCGTGCA GAAGCAGCAG CTATTTCTTA TCCCAATTTT
ACCAACACTT TAGTAGAAGT TTGTCGTTAA
 
Protein sequence
MSAAVIILET QENASHNLII QRPPAGLSLQ GRIRVPGDKS ISHRALMLAA IAEGETQIQG 
LLLGEDPRST ANCFRVLGAE ISELNTEFVR VKGIGLGNFQ EPFNVLNAGN SGTTIRLMLG
LLASHPGRFF TVTGDDSLRA RPMSRVVKPL QQMAAEIWGR KGNSLAPLAI QGQALKPIHY
HSPIASAQVK SCILLAGLNT EGQTTFTEPA LSRDHSERML RAFGAELSID PETNSVTVTG
NAKLYGQKVI VPGDISSAAF WLVAGAIVPG SDLVVENVGV NPTRTGVLEA LSMMGADIEL
ENQREVAGEP VADLRVCSSQ LKSCTIAGDI IPRLIDEIPI LAVAAAFAQG TTIIRDAAEL
RVKESDRITV MAQQLNQMGA KVTQLPDGME ITGGTPLMGA EVDSYTDHRI AMSLAIAALN
ASGTTTIHRA EAAAISYPNF TNTLVEVCR