Gene Aazo_4888 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_4888 
Symbol 
ID9342695 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp5002991 
End bp5004190 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content43% 
IMG OID 
Productargininosuccinate synthase 
Protein accessionYP_003723149 
Protein GI298492972 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTCGCG CCAAAAAGGT TGTATTAGCA TATTCTGGTG GAGTTGATAC TTCTGTTTGC 
ATACCCTACT TGAAAAAAGA GTGGGGAGTT GAAGAGGTAA TTACCCTAGC AGCAGATTTA
GGTCAGGGAG ATGAATTAGA ACTAGTGCGA GAAAAAGCTC TCAAATCTGG TGCAAGTGAA
TCCCTGGTAG CGGATGTCAA AAAGAGTTTC GTGACAGAGT ATGCATTTCC CGCAATTCAA
GCCAATGCTC TGTATGAAAA TCGCTATCCT CTAGGAACAG CCCTAGCTAG ACCTTTAATT
GCTCAGATTC TGGTAGAAAC AGCTCAAAAA TACGGTGCTG ATGCGATCGC TCACGGTTGC
ACAGGTAAAG GTAACGACCA AGTACGTTTT GATGTTTCCT GTACAGCCCT CAATCCCAAT
CTGAAAATTC TTGCCCCAGC TAGAGAATGG GGAATGAGTC GAGAACAAAC CATAGCTTAC
GGTGAACAAT TTGGTATTCC TGCACCCGTG AAAAAATCCT CTCCCTTCAG TATAGATAAA
AACCTGCTTG GTCGCAGTAT TGAAGCTGGT ACGTTGGAAG ATCCAGCAAA TGAGCCACCA
GAAGAAATCT ATGAAATGAC CAAAGCCATA GCAGATACTC CTAAGGAACC AGAATATCTA
GAAATTGGCT TCCAAAGAGG TATTCCTACG ACCATCAACG GTACGTCTAA AAACCCTGTT
GAATTAATTG AACAACTCAA TCAAATCATA GGAAATCACG GTATTGGGCG GATTGACATC
ATTGAAAACC GCTTAGTAGG TATCAAATCA CGGGAAATCT ACGAATCACC TGCAATGGTA
GTTCTCATCA ACGCCCACCG CGATTTAGAA AGCCTGACCT TAACAGCAGA TGTTACTCAG
TATAAACGGG GCATTGAAGA AACTTACACC AAAATTGTAT ACAACGGACT TTGGTACAGC
CCTCTCAAAG CTGCCTTAGA TGCCTTTATT CAACAAACAC AAGAGCAAGT TTCTGGTGTT
GTGCGGTTAA AACTTTTCAA AGGTAATGCC ACCATAGTTG GTCGCTGGAG TGATAATTCC
CTTTACACTC CTGATTTAGC AACCTACGGA GCAGAAGATC AATTCAATCA CAAAGCTGCA
GAAGGGTTTA TCTACGTTTG GGGTCTACCT ACCCGCATTT GGGCGCAGAG CAACAAATAA
 
Protein sequence
MGRAKKVVLA YSGGVDTSVC IPYLKKEWGV EEVITLAADL GQGDELELVR EKALKSGASE 
SLVADVKKSF VTEYAFPAIQ ANALYENRYP LGTALARPLI AQILVETAQK YGADAIAHGC
TGKGNDQVRF DVSCTALNPN LKILAPAREW GMSREQTIAY GEQFGIPAPV KKSSPFSIDK
NLLGRSIEAG TLEDPANEPP EEIYEMTKAI ADTPKEPEYL EIGFQRGIPT TINGTSKNPV
ELIEQLNQII GNHGIGRIDI IENRLVGIKS REIYESPAMV VLINAHRDLE SLTLTADVTQ
YKRGIEETYT KIVYNGLWYS PLKAALDAFI QQTQEQVSGV VRLKLFKGNA TIVGRWSDNS
LYTPDLATYG AEDQFNHKAA EGFIYVWGLP TRIWAQSNK