Gene Clim_1250 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_1250 
Symbol 
ID6355351 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp1349175 
End bp1350377 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content54% 
IMG OID642668866 
Productargininosuccinate synthase 
Protein accessionYP_001943296 
Protein GI189346767 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0137] Argininosuccinate synthase 
TIGRFAM ID[TIGR00032] argininosuccinate synthase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAAGG AAAAAATCGC ACTTGCCTAT TCCGGAGGCC TCGATACCTC CGTGATGATC 
AAATGGCTCA AAGACAAGTA TGACGCCGAA ATTGTTGCCG TTACCGGTAA CCTCGGCCAG
CAGAAAGAGA TCGAAAATCT CGAATCAAAA GCATATTCGA CGGGAGCCTC GGCTTTCAGG
TTTGTCGATC TCCGCAAAAC CTTTGTTGAA GAGTATATCT GGCGGGCACT GAAAGCCGGC
GCCCTTTACG AGGATGTCTA TCCGCTGGCA ACGGCGCTCG GGCGTCCGCT GCTTGCCAAA
GCGCTTGTCG ATGTGGCACT TGAGGAGAAC TGCACCATGC TGGCCCACGG CTGTACCGGA
AAAGGAAACG ACCAGGTTCG TTTCGAAGTG ACCTTTGCTT CGCTTGCTCC CCATCTGAAA
ATTCTCGCTC CCCTGCGCGA ATGGGAGTTC ACTTCTCGCG AGGCAGAGAT CGCCTACGCT
CTCGAACATA ACATACCGGT ATCGGCCACA AAGAAAAGCC CCTACTCGAT CGACGAGAAC
ATCTGGGGCA TCAGTATCGA ATGCGGCGTG CTCGAAGATC CCATGGTGAC TCCTCCCGAA
GATGCCTACC AGATCACCAC CTCTCCGGAA AATGCGCCCG ATACTCCGGC ATCGGTGGAG
ATCGAATTTG TGAAAGGCAT ACCGGTAGCT CTCGACGGCG AGCGTATGAG CGGACTCGAC
ATGATCCAGA AACTCAACGA CATCGGCGCG GCAAATGGCA TCGGACGTCT CGACATGATC
GAGAACCGCG TTGTCGGCAT CAAGTCGCGT GAAATCTACG AGGCACCGGC AGCAACCATC
CTGCACTTCG CACACCGTGA GCTGGAGCGG CTGACGCTTG AAAAAACCGT ATTCCAGTAC
AAGAAGAACA TCAGCCAGGA CTACGCCAAC ATCATCTATA ACGGCACCTG GTTCTCCCCG
ATGCGCAAGG CACTTGATGC CTTCGTCGAC GAAACCCAGA AACCGGTAAC CGGTCTTGTG
CGCCTGAAGC TTTACAAAGG CGGTATCTCG CTGCTCGGCA GAAACTCGCC GAACTCGCTC
TACAACGAAG AACTTGCGAC CTACACCGAA GCCGATACCT TCAACCACAA GGCAGCGGCA
GGGTTCATTC ACCTGTACGG GCTTGGCATG AAAACCTTCA GCCAGGTCAA TCCCGGTCTG
TAA
 
Protein sequence
MSKEKIALAY SGGLDTSVMI KWLKDKYDAE IVAVTGNLGQ QKEIENLESK AYSTGASAFR 
FVDLRKTFVE EYIWRALKAG ALYEDVYPLA TALGRPLLAK ALVDVALEEN CTMLAHGCTG
KGNDQVRFEV TFASLAPHLK ILAPLREWEF TSREAEIAYA LEHNIPVSAT KKSPYSIDEN
IWGISIECGV LEDPMVTPPE DAYQITTSPE NAPDTPASVE IEFVKGIPVA LDGERMSGLD
MIQKLNDIGA ANGIGRLDMI ENRVVGIKSR EIYEAPAATI LHFAHRELER LTLEKTVFQY
KKNISQDYAN IIYNGTWFSP MRKALDAFVD ETQKPVTGLV RLKLYKGGIS LLGRNSPNSL
YNEELATYTE ADTFNHKAAA GFIHLYGLGM KTFSQVNPGL