Gene Cphamn1_1332 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_1332 
Symbol 
ID6375010 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp1446307 
End bp1447515 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content52% 
IMG OID642683829 
Productargininosuccinate synthase 
Protein accessionYP_001959743 
Protein GI189500273 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0137] Argininosuccinate synthase 
TIGRFAM ID[TIGR00032] argininosuccinate synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00292839 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAGGG AAAAAATCGC TTTAGCCTAT TCCGGCGGGC TCGATACATC TGTCATGATC 
AAGTGGCTGA AAGATAAATA CGATGCTGAT ATCATTGCCG TGACCGGCAA TCTCGGCCAA
GAGAAAGAAA TAGAAAACCT TGAACAAAAG GCTTTTGATA CGGGAGCGTC AGGGTTCTCT
TTCCTGGATT TGCGCAAAGA ATTCGTTGAA AACTATATCT GGCCCGCGCT CAAGGCCGGG
GCACTGTACG AAGAGGTGTA TCCTCTCGCG ACAGCCCTTG GCCGCCCGCT GCTAGCCAAA
GCCCTTGTGG ATGTCGCTCT CTCCGAAGAC TGCACCATGA TCGCCCACGG ATGTACCGGC
AAGGGTAACG ACCAGGTCCG TTTTGAAGTA ACCTTCGCTT CGCTTGCACC TCACCTCAAG
GTTCTTGCCC CGCTTCGCGA ATGGGAGTTC AATTCGAGGG AAGCAGAGAT GGCCTATGCT
GAAAAACACG GTATCCCGGT TTCCGCCACA AAAAAGAGTC CCTACTCCAT CGATGAAAAC
ATCTGGGGAA TCAGCATCGA GTGCGGTGTT CTTGAAGATC CGATGGTAGC TCCGCCGGAA
GACGCGTATC AGATCACGAC ATCTCCGGAA AAAGCCCCTG ACAACGCCGC GGTAATCGAT
ATTGAATTTG AACAGGGTGT GCCTGTCGCG CTTGACGGCA GGAAAATGGA AGGCCTGGAT
CTTATTGTGG AACTCAACAA GCACGGTGCC GCCCATGGAG TCGGCCGTCT GGATATGATC
GAGAACCGGG TTGTCGGCAT CAAGTCAAGA GAGATCTATG AAGCGCCTGC GGCTACGATC
CTCCATTTTG CCCATCGTGA ACTTGAAAGG CTGACACTGG AAAAAACCGT TTTTCAATAC
AAAAACACGA TCAGCCAGGA CTACGCCAAC CTTATCTACA ACGGCACCTG GTTCTCCCCG
ATGCGCGAAG CACTTGACGG ATTCGTTGAC GCAACCCAGA AGCATGTCAC CGGCCTTGTC
AGGGTCAAGC TCTTCAAAGG CTCAGTCACC CTGCTCGGAA GAACATCTCC CTGGTCGCTC
TACAATGAAG AACTGGCCAC GTACACCGAA GCCGACACCT TCAATCACAA GGCCGCCGAA
GGGTTCATTC ACCTCTACGG CCTCGGTCTG AAAACCTACA GCGAGGTTCA GGCGAATAAC
AGAAAGTAA
 
Protein sequence
MKREKIALAY SGGLDTSVMI KWLKDKYDAD IIAVTGNLGQ EKEIENLEQK AFDTGASGFS 
FLDLRKEFVE NYIWPALKAG ALYEEVYPLA TALGRPLLAK ALVDVALSED CTMIAHGCTG
KGNDQVRFEV TFASLAPHLK VLAPLREWEF NSREAEMAYA EKHGIPVSAT KKSPYSIDEN
IWGISIECGV LEDPMVAPPE DAYQITTSPE KAPDNAAVID IEFEQGVPVA LDGRKMEGLD
LIVELNKHGA AHGVGRLDMI ENRVVGIKSR EIYEAPAATI LHFAHRELER LTLEKTVFQY
KNTISQDYAN LIYNGTWFSP MREALDGFVD ATQKHVTGLV RVKLFKGSVT LLGRTSPWSL
YNEELATYTE ADTFNHKAAE GFIHLYGLGL KTYSEVQANN RK