Gene Hhal_1775 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1775 
Symbol 
ID4711004 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1947429 
End bp1950650 
Gene Length3222 bp 
Protein Length1073 aa 
Translation table11 
GC content67% 
IMG OID639856245 
Productcarbamoyl-phosphate synthase, large subunit 
Protein accessionYP_001003341 
Protein GI121998554 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) 
TIGRFAM ID[TIGR01369] carbamoyl-phosphate synthase, large subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0103005 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGAAAA GAACTGACAT CCAATCGATC CTGATCATCG GCGCCGGCCC CATCGTCATC 
GGCCAGGCGT GCGAGTTCGA CTACTCCGGT GCGCAGGCCT GCAAGGCCCT GCGCGAGGAG
GGCTATCGGG TCATCCTGGT GAACTCCAAC CCGGCGACCA TCATGACCGA CCCGGAGACC
GCCGACGCGG TCTATATCGA ACCGGTGGAG TGGCAGACGG TCTCGCGGAT CATCGAGCGG
GAGCAGCCGG ATGCGGTGCT GCCCACCATG GGCGGGCAGA CGGCGCTGAA TTGTGCCCTC
GATCTGGTCA AGCACGGCGT GCTGGAGCGA TACGGCGTCG AGATGATTGG CGCCAGCCGG
GAGGCGATCG ACAAGGCTGA GGAGCGCGAG GCGTTTCGCG CGGCGATGGC CCGCATCGGT
CTGGAGACGC CGCGGGCCGA GCTGGCCCGT TCCATGGCCG AGGCCCAGGC GGCGCAGGCA
CGCATGGGCT TCCCGGTGAT CATCCGCCCC TCCTACACCC TGGGCGGCTC CGGCGGCGGC
ATCGCCTACA ACCGCGAGGA GTTCAACGAG ATCGTCGAGC GCGGGCTCGA CCTCTCCTAC
ACCAACGAGG TGCTCCTCGA GGAGTCGGTG CTCGGCTGGA AGGAGTATGA GATGGAGGTC
GTGCGGGACC GGCACGACAA CGCCATCATC GTCTGCTCCA TCGAGAACCT CGACCCCATG
GGGGTGCACA CCGGCGACTC CACCACCATT GCACCGGCGC AGACCCTGAC CGATAAGGAG
TACCAGCTGA TGCGCGACGC ATCGCTGGCG GTGCTGCGCG AGATCGGTGT GGAGACCGGC
GGATCCAACG TCCAGTTCGC CATCAACCCG GACAACGGGC GCATGGTGAT CATCGAGATG
AATCCGCGGG TGTCGCGCTC CTCGGCGCTG GCCTCCAAGG CGACCGGCTT CCCCATCGCC
AAGGTGGCGG CCAAGCTCGC CGTCGGTTAC ACGCTGGATG AGCTGCGCAA CGAGATCACC
GGCGGGGCGA CGCCGGCCTC CTTCGAGCCG ACCATCGACT ACGTGGTCAC CAAGATCCCG
CGTTTCACCT TCGAGAAGTT CCCCCAGGCC GAGTGCTACC TGACGACACA GATGAAGTCG
GTGGGCGAGG TCATGGCCAT CGGGCGGACC TTCCAGGAGT CCTTCCAGAA GGCCCTGCGG
GGTCTGGAGC AGGACCTTTC CGGGCTCGAC GAGCGGCTCG ATCGCAGCCG TCAGGATGTG
CGCGATACGG TGCGCCACTC GTTGCGTCAG CCGACGCCCG AACGGGTCCT GCATCTGGGT
GACGCCTTCC GGGTCGGCTT TACCCTGGAC GAGGTCCACG GGATGACGGC CATCGATCCG
TGGTTCCTGG CGCAGATCGA GGAACTGATC GCCGTGGAGG GGCAGGTCGC CGCGAGCGCG
CTGGACGATT GTGATGCCGG TGCCTTACTG CGGCTCAAGC GCCGCGGTTT CTCCGATGCC
CGGTTGGCCA GCCTGTGGGG CGTTACCGAG GCGCAGGTGC GGCAGCGTCG CCGTGAGCTC
GGCGTGCGGC CGGTGTTCAA GCGGGTGGAC TCCTGCGCCG CCGAGTTCCC CACCGCCACG
GCGTACCTCT ACTCGACCTA CGAGGAGGAG TGCGAGGCCG AGCCCACCGG GCGCAAGAAG
ATCATGGTCC TTGGTGGCGG CCCGAATCGC ATCGGCCAGG GGATCGAGTT CGACTACTGC
TGTGTCCACG CATCGCTGTC GCTGCGCGAG GACGGCTACG AGACCATCAT GGTCAACTGC
AACCCCGAGA CGGTCTCGAC GGACTACGAC ACTTCGGATC GGCTGTACTT CGAGCCGCTG
ACCCTGGAGG ACGTGCTCGA GGTGGTGGAG ACCGAGCAGC CGGATGGGGT GGTCGTCCAG
TACGGTGGGC AGACGCCGCT GAAGCTTGCC CGCGAGCTGG AGGCTGCCGG GACACCGATC
ATCGGCACCA GCCCGGACTC CATCGACCTG GCCGAAGACC GGGAGCGGTT CCAGGAGCTC
ATCGGACGGA TCGACCTGAT GCAGCCGCCG AACCGCACCG CCCGGACCGA GACCGAGGCG
CTTCAGCTGG CTGCCGAGAT CGGTTACCCG CTGGTGGTGC GCCCTTCGTA CGTGCTCGGC
GGGCGGGCGA TGGAGATCGT CTACGAGGAG AGCGAGCTGC GCCAGTACAT GAATGAGGCG
GTGCGGGTCT CGCACAACTC GCCCGTCCTG CTTGACCGCT TCCTCGACGA CGCCGTGGAG
GTGGATGTGG ACGCCGTCAG CGACGGCGAC CAGGTGGTCA TCGGCGGGAT CATGCAGCAC
ATCGAGCAGG CCGGCGTCCA CTCCGGGGAC TCCGCCTGCT CCATCCCGCC CTACACCCTG
GGGCAGGATG TGCAGGACCG GATTCGCGAG CAGGTGCGGC TGCTGGCCCG GGAGCTCGGT
GTGGTCGGAC TGATGAACGT GCAGTTCGCC ATCCAGGGGC AGCGCATCTT CCTCCTCGAG
GTCAATCCGC GTGCCTCGCG GACGGTGCCG TACGTCTCCA AGGCCTGCGG TGTGCCCCTG
GCCAAGGTGG CTGCGCGGTG CATGGCCGGC CGGACGCTGG CCGAGCAGGG GGTGGTGAGC
GAAGTCATTC CCAACTACTA TTCGGTCAAA GAGGCGGTCT TCCCGTTCCT CAAATTCCCC
GGTGTCGATC CCATCCTGGG TCCGGAGATG AAATCTACCG GAGAGGTGAT GGGTATTGGC
GCCTGTTTCG GAGAGGCCTA CGCCAAGGCG CAGCTGGCTG CGGGGGTGAC CCTGCCGCGG
GGCGGCTGTG CCTTTGTCAG CGTGCGTGAA GTGGACAAGG AGGCAGCGGT GGAGGTGGCG
CGGGACTTGG TCCGACGCGG TTTCCGCTTG ATCGCCACCC ATGGCACAGC GGCCGCCCTC
GAAGAGGCGG GCCTGGAGGT GCGCCGGATC AACAAGGTCA TTGAGGGACG GCCGCATGTC
GTGGACGCCA TCAAGAACGA CGAGATCGAC CTGATCGTGA ACACCACCGA GGGGCGGCAG
GCCATCGCCG ACTCCTACTC GATCCGCCGC GAGGCGCTGC AGCGCAAGGT CTGTTACACG
ACGACCATCG CGGGCGCTCG GGCGACGTGC CTGGCGCTGG ATCACATGAA GGACTGGGAG
GCCCGCCCCC TCGATGCCCT GCACAGGGAG ATGACGGGAT GA
 
Protein sequence
MPKRTDIQSI LIIGAGPIVI GQACEFDYSG AQACKALREE GYRVILVNSN PATIMTDPET 
ADAVYIEPVE WQTVSRIIER EQPDAVLPTM GGQTALNCAL DLVKHGVLER YGVEMIGASR
EAIDKAEERE AFRAAMARIG LETPRAELAR SMAEAQAAQA RMGFPVIIRP SYTLGGSGGG
IAYNREEFNE IVERGLDLSY TNEVLLEESV LGWKEYEMEV VRDRHDNAII VCSIENLDPM
GVHTGDSTTI APAQTLTDKE YQLMRDASLA VLREIGVETG GSNVQFAINP DNGRMVIIEM
NPRVSRSSAL ASKATGFPIA KVAAKLAVGY TLDELRNEIT GGATPASFEP TIDYVVTKIP
RFTFEKFPQA ECYLTTQMKS VGEVMAIGRT FQESFQKALR GLEQDLSGLD ERLDRSRQDV
RDTVRHSLRQ PTPERVLHLG DAFRVGFTLD EVHGMTAIDP WFLAQIEELI AVEGQVAASA
LDDCDAGALL RLKRRGFSDA RLASLWGVTE AQVRQRRREL GVRPVFKRVD SCAAEFPTAT
AYLYSTYEEE CEAEPTGRKK IMVLGGGPNR IGQGIEFDYC CVHASLSLRE DGYETIMVNC
NPETVSTDYD TSDRLYFEPL TLEDVLEVVE TEQPDGVVVQ YGGQTPLKLA RELEAAGTPI
IGTSPDSIDL AEDRERFQEL IGRIDLMQPP NRTARTETEA LQLAAEIGYP LVVRPSYVLG
GRAMEIVYEE SELRQYMNEA VRVSHNSPVL LDRFLDDAVE VDVDAVSDGD QVVIGGIMQH
IEQAGVHSGD SACSIPPYTL GQDVQDRIRE QVRLLARELG VVGLMNVQFA IQGQRIFLLE
VNPRASRTVP YVSKACGVPL AKVAARCMAG RTLAEQGVVS EVIPNYYSVK EAVFPFLKFP
GVDPILGPEM KSTGEVMGIG ACFGEAYAKA QLAAGVTLPR GGCAFVSVRE VDKEAAVEVA
RDLVRRGFRL IATHGTAAAL EEAGLEVRRI NKVIEGRPHV VDAIKNDEID LIVNTTEGRQ
AIADSYSIRR EALQRKVCYT TTIAGARATC LALDHMKDWE ARPLDALHRE MTG