Gene Acid345_4158 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4158 
Symbol 
ID4072117 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4924422 
End bp4925639 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content61% 
IMG OID637986189 
Productargininosuccinate synthase 
Protein accessionYP_593232 
Protein GI94971184 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0137] Argininosuccinate synthase 
TIGRFAM ID[TIGR00032] argininosuccinate synthase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0852735 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.163205 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTGAAA AAATCGTTCT TGCTTATTCC GGAGGCCTTG ATACCTCCAT CATCATTCCT 
TGGCTGAAGG AAAACTATAG CTGCGACGTC ATTGCCATGG TTGGTGACGT CGGACAGGGC
GACGACATCG ATGCCGTCGT AGCGAAGGCT CATAAAACGG GCGCTTCCAA AGTCATCGTA
AAGGACCTCC GCGAGGAGTT CCTCACCCAG TACGTCTATC CCGCGATCTC CACCGGCGCG
GTCTACGAGC ACAAGTATCT GCTCGGAACT TCGCTCGCAC GTCCGCCGAT CGCGAAGGCG
CAAGTCGAAG TCGCACTCGC CGAAGGCGCC ACTGCGGTCT CGCACGGCTG CACTGGCAAG
GGCAACGACC AGGTCCGCTT CGAACATGCC TTCCAGGCGC TCGCGCCGGA ACTCAAGATC
ATCGCTCCCT GGCGCGAATG GACGCTGAAG TCCCGCGAAG ATTGTCTCGA CTACGCCGAA
GCCCATGGCA TCTCCGTTGC GCAGAGCCGC GAGAAGATCC ACTCCCGCGA TCGCAACTTG
CTGCACGTTA GCCACGAGGG CGGGGAACTC GAAGATCCCA ACAACGCTCC ACTCGATACC
ACTTGGACGT GGACCAAGTC CCCGCAGGAA GCGCCCGATC GCGTCGAAGA AGTCACCATC
GGATTCGAAG GCGGCGTGCC GGTTTCCATC AACGGCATGA AGCTCGAACC GCTCGCGCTC
ATCGAGTTGC TCAATGAAAT TGGCGCGCGC AACGCCATCG GCCGCATCGA TCTCGTCGAG
AACCGCTTCG TCGGCATCAA GTCGCGTGGC TGCTATGAGA CCCCCGGCGG ATCTCTGCTG
CTCGCCGCGC ATCGCGAACT CGAGGCCCTC TGTCTCGATC GCGACACCCT GCACTACAAG
CAGGAAGTCG CGCTCAAGTG GGCGGAACTC GTGTACTTCG GCCTCTGGTT CACGCCGCTG
CGCGAATCGC TCGACGCTTT CGTCGCGAGC ACGCAGAAGA ACATTGCCGG CGCCGTGAAG
CTCGCCCTCT ACAAGGGCAA CATCGCCGTA GCCGGGCGCA CCTCGCCCAA ATCGCTCTAT
CGTCCCGACA TCGCCAGCTT CACCATGGGT GCGGGCTACG ACCAGAAGGA CGCCGAAGGC
TTCATCCGCA TTCTCGGACT GCCCGCGCGT TCGCGTGCGC TCATCGAGAA CGCCGGCAAA
GAAAAGGTGT CGAAATGA
 
Protein sequence
MREKIVLAYS GGLDTSIIIP WLKENYSCDV IAMVGDVGQG DDIDAVVAKA HKTGASKVIV 
KDLREEFLTQ YVYPAISTGA VYEHKYLLGT SLARPPIAKA QVEVALAEGA TAVSHGCTGK
GNDQVRFEHA FQALAPELKI IAPWREWTLK SREDCLDYAE AHGISVAQSR EKIHSRDRNL
LHVSHEGGEL EDPNNAPLDT TWTWTKSPQE APDRVEEVTI GFEGGVPVSI NGMKLEPLAL
IELLNEIGAR NAIGRIDLVE NRFVGIKSRG CYETPGGSLL LAAHRELEAL CLDRDTLHYK
QEVALKWAEL VYFGLWFTPL RESLDAFVAS TQKNIAGAVK LALYKGNIAV AGRTSPKSLY
RPDIASFTMG AGYDQKDAEG FIRILGLPAR SRALIENAGK EKVSK