Gene Ava_2068 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_2068 
Symbol 
ID3680541 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp2559984 
End bp2561186 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content47% 
IMG OID637717413 
Productargininosuccinate synthase 
Protein accessionYP_322585 
Protein GI75908289 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0137] Argininosuccinate synthase 
TIGRFAM ID[TIGR00032] argininosuccinate synthase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.126531 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0446065 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTCGCG CCAAAAAGGT TGTTCTGGCA TATTCTGGTG GAGTAGATAC CTCCGTTTGC 
ATTCCCTACC TAAAACAAGA GTGGGGAGTA GAAGAGGTAA TTACCCTAGC AGCAGATTTA
GGCCAGGGAG ATGAATTAGA ACCAATCCGA GAAAAAGCAC TAAAATCGGG TGCAAGTGAA
TCCCTCGTAG CGGATGTCAA AGAAAGTTTT ATTAAAGATT ATGCTTTTCC CGCGATACAG
GCCAACGCCC TCTATGAAAA TCGCTATCCG CTAGGAACTG CCTTAGCCCG TCCGTTGATT
GCGAAGATAT TGGTCGAAGC AGCAGAAAAA TACGGTGCTG ATGCGATCGC ACATGGTTGT
ACAGGTAAAG GTAATGATCA GGTGCGGTTT GATGTCTCTT GTACTGCTCT CAATCCTAAA
TTGAAGATCC TCGCCCCCGC ACGGGAATGG GGTATGAGTC GGGAAGCAAC CATCGCCTAC
GGTGAAAAGT TTGGCATTCC CTCACCTGTG AAAAAGTCTT CACCTTACAG CATTGATAAG
AATTTACTTG GTCGCAGTAT TGAAGCTGGT GCGCTGGAAG ATCCCAAATT TGAGCCACCA
GAAGAAATTT ATGAGATGAC AAAGGCGATC GCCGACACCC CAAATGAGCC AGAATACATT
GAAATAGGAT TTACTCAAGG TCTTCCCACG ACTATCAACG GTACACCCAA AGACCCTGTA
GCGCTCATCC AAGAACTAAA TCAACTAGTT GGTAGTCACG GTGTCGGACG CATCGATATG
ATTGAAAACC GATTAGTGGG AATCAAATCA CGAGAGATTT ACGAATCTCC GGCAATGTTG
GTGCTAATTC AAGCCCACAG AGACCTAGAA AGCCTGACAT TAACCGCCGA TGTTAGCCAC
TACAAACGAG GTATTGAAGA GACCTACAGC CAAATAGTTT ACAACGGCTT GTGGTACAGT
CCCCTGAAAG CCGCTTTAGA TGCCTTTATT CAAAAGACTC AAGAACGGGT ATCAGGAATT
GTCAGAGTGA AACTATTCAA AGGTAACGCC ACCATAGTCG GACGCTGGAG CGATAGTTCA
CTCTACACCC CCGACTTAGC AACTTACGGC GCTGAAGACC AATTTGACCA CAAAGCCGCC
GAAGGCTTCA TTTACGTTTG GGGATTACCA ACTCGTATCT GGGCGCAGCA GGATAGAGGT
TAA
 
Protein sequence
MGRAKKVVLA YSGGVDTSVC IPYLKQEWGV EEVITLAADL GQGDELEPIR EKALKSGASE 
SLVADVKESF IKDYAFPAIQ ANALYENRYP LGTALARPLI AKILVEAAEK YGADAIAHGC
TGKGNDQVRF DVSCTALNPK LKILAPAREW GMSREATIAY GEKFGIPSPV KKSSPYSIDK
NLLGRSIEAG ALEDPKFEPP EEIYEMTKAI ADTPNEPEYI EIGFTQGLPT TINGTPKDPV
ALIQELNQLV GSHGVGRIDM IENRLVGIKS REIYESPAML VLIQAHRDLE SLTLTADVSH
YKRGIEETYS QIVYNGLWYS PLKAALDAFI QKTQERVSGI VRVKLFKGNA TIVGRWSDSS
LYTPDLATYG AEDQFDHKAA EGFIYVWGLP TRIWAQQDRG