Gene Sala_1223 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_1223 
Symbol 
ID4080447 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp1267078 
End bp1269084 
Gene Length2007 bp 
Protein Length668 aa 
Translation table11 
GC content65% 
IMG OID638009583 
Productcarbamoyl-phosphate synthase L chain, ATP-binding 
Protein accessionYP_616271 
Protein GI103486710 
COG category[I] Lipid transport and metabolism 
COG ID[COG4770] Acetyl/propionyl-CoA carboxylase, alpha subunit 
TIGRFAM ID[TIGR00514] acetyl-CoA carboxylase, biotin carboxylase subunit 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.116496 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTCAAGA AAATCCTGAT CGCCAACCGC GGCGAAATCG CCTGCCGCGT CATCAAGACC 
GCGCGCCGCA TGGGCATCGC GACGGTCGCC GTCTATTCGG ACGCCGATGC GCGCGCGCCC
TTTGTCCAGA TGGCTGACGA GGCGGTGCAT ATCGGGCCGT CGCCCGCGTC CGAATCCTAT
CTGGTCGCCG ACAAGATCAT CGCCGCGTGC AAGGCGACGG GCGCCGAGGC GGTGCATCCG
GGCTATGGCT TCCTGTCCGA GCGCACCAGC TTCGCCGAGG CGCTCGCGAA GGAAAATATC
GCCTTCATCG GCCCGCCGGT GAACGCGATC GCGGCGATGG GCGACAAGAT CGAGTCGAAG
AAGCTCGCGA AAGAGGCGGG GGTCAATGTC GTCCCCGGCT TCGTCGGCGA GATCCGCGAC
ACCGAACATG CGGTCGAGAT TTCGAACGAG ATCGGCTATC CGGTGATGAT GAAGGCGTCG
GCGGGCGGCG GCGGCAAGGG GATGCGCCTC GCCTATGACG AGAAGGACGT CCGCGAGGGG
TTCGAGGCGA CCAAGCGCGA GGGGCTCAAC AGCTTTGGCG ACGACCGCGT GTTTATCGAG
AAGTTCATCC TCAACCCGCG CCATATCGAA ATCCAGATCC TCGGCGATCA GCACGGCAAC
ATTCTCTACC TCAACGAGCG CGAATGCAGC ATCCAGCGGC GGCATCAGAA GGTCGTCGAG
GAAGCGCCGT CGCCCTTCGT CACGCCGAAG ATGCGTCAGG CGATGGGCGA GCAGTGCGTC
GCGCTGGCGC GCGCGGTCGG CTATTACAGC GCAGGCACGG TCGAGCTGAT CGTGTCGGGC
GCCGACCCGA CGGGCGAGAG CTTCTACTTC CTCGAAATGA ACACGCGGCT GCAGGTCGAG
CATCCGGTGA CCGAGGCGAT CACCGGCATC GACCTGGTCG AGCAGATGAT CCGCGTCGCC
GCGGGCGAAA AGCTGGAGAT GACGCAGGAC GACATCAAGA TCGACGGCTG GGCGATCGAG
AATCGCGTCT ATGCCGAAGA TCCCTATCGC GGCTTCCTGC CCTCGACCGG GCGGCTCGTG
CGCTACCGCA CCCCCGTTCC CGCATGGACG GGCGACGAGC GCGGCGTCGA TGGTGTGCGC
GTCGATGCGG GGGTTGAGGA GGGCGGCGAG GTGTCGATCT TCTACGACCC GATGATCGCG
AAGCTGGTGA CATGGGGCAA GACGCGCGAC GAGGCGGCGG ACCTTCAGGT CGCGGCGCTC
GACCGTTTCG AGATCGAGGG GCTGGGCCAC AATATCGATT TCGTGTCGGC GATCATGCAG
CACCCGCGCT TCCGCTCGGG CGAACTGACG ACGGGCTTTA TCGCCGAGGA ATATCCCGAG
GGGTTCCACG GTGCCGACAC CAGCGAGGAT GTGACCCAGG CACTCGCCGC CATCGCGGGC
TTTATGGCGA GCGCCGAAGC CGACCGCGCA CGGCGCACCG ACGGACAGCT CGGCGACCGG
CTCGACCCGC CCGCCAAGTG GCAGGTGACC ATCGGCGGCG CGAGCCACAA GGTGAAGATC
GGCCGCAAGC ACATCAAGGT CGATGGTGAG AAGGTCGATA TCGCGCTCGA ATATACGCCG
GGCGACCGGT TGGTGGTCGC CGAGATCGAC GATAGCGAGC TTGCGGTGAA GGTCGCCAAA
ACGCGCACCG GCTGGCGCAT GACGACGCGC GGGCGCATCC ATGACGTGCG CGTGCTGCCT
TGGCATGTCG CACCACTCGC AAGCCATATG ATCGAGAAGA TCCCGCCCGA CCTGTCGAAG
TTTCTGATCT GCCCGATGCC CGGCTTGCTT GTCGCGCTGC ATGTGGGCGA GGGCGACAAG
GTCGAGGCAG GTCAGCCGCT CGCGACGGTC GAGGCGATGA AGATGGAAAA TATCCTGCGC
GCCGAAAAAG CGGGCGTTGT GAAGTCGGTC AATGCAGCGC AGGGCGACAG CCTGGCGGTC
GATGCCGTGA TTTTGGAGAT GGAGTGA
 
Protein sequence
MFKKILIANR GEIACRVIKT ARRMGIATVA VYSDADARAP FVQMADEAVH IGPSPASESY 
LVADKIIAAC KATGAEAVHP GYGFLSERTS FAEALAKENI AFIGPPVNAI AAMGDKIESK
KLAKEAGVNV VPGFVGEIRD TEHAVEISNE IGYPVMMKAS AGGGGKGMRL AYDEKDVREG
FEATKREGLN SFGDDRVFIE KFILNPRHIE IQILGDQHGN ILYLNERECS IQRRHQKVVE
EAPSPFVTPK MRQAMGEQCV ALARAVGYYS AGTVELIVSG ADPTGESFYF LEMNTRLQVE
HPVTEAITGI DLVEQMIRVA AGEKLEMTQD DIKIDGWAIE NRVYAEDPYR GFLPSTGRLV
RYRTPVPAWT GDERGVDGVR VDAGVEEGGE VSIFYDPMIA KLVTWGKTRD EAADLQVAAL
DRFEIEGLGH NIDFVSAIMQ HPRFRSGELT TGFIAEEYPE GFHGADTSED VTQALAAIAG
FMASAEADRA RRTDGQLGDR LDPPAKWQVT IGGASHKVKI GRKHIKVDGE KVDIALEYTP
GDRLVVAEID DSELAVKVAK TRTGWRMTTR GRIHDVRVLP WHVAPLASHM IEKIPPDLSK
FLICPMPGLL VALHVGEGDK VEAGQPLATV EAMKMENILR AEKAGVVKSV NAAQGDSLAV
DAVILEME