Gene Dret_0320 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_0320 
Symbol 
ID8418124 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp395568 
End bp398804 
Gene Length3237 bp 
Protein Length1078 aa 
Translation table11 
GC content59% 
IMG OID645036885 
Productcarbamoyl-phosphate synthase, large subunit 
Protein accessionYP_003197200 
Protein GI258404458 
COG category[F] Nucleotide transport and metabolism
[E] Amino acid transport and metabolism 
COG ID[COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) 
TIGRFAM ID[TIGR01369] carbamoyl-phosphate synthase, large subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value3.36143e-07 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCAAAA GGACTGATAT CCGGAAAATT ATGCTCATCG GCTCCGGTCC GATTATTATC 
GGACAGGCCT GCGAGTTTGA CTATTCAGGC ACTCAGGCCC TCAAGGCCCT GAAAGAAGAG
GGATACGAGG TTGTCCTGAT CAACTCCAAT CCGGCAACGA TTATGACCGA TCCGGGTTTG
GCCGACCGCA CCTATGTCGA ACCCCTGGAA CCAGAAAGTG TGGCCAAGAT TATTGCCAAA
GAGCGTCCGG ATGCCTTGCT GCCGACTCTC GGCGGCCAAA CGGGCTTGAA TACGGCGGTT
TCCGTGGCCG AATCCGGGGT GCTGGAAAAA TACGGGGTCG AACTCATCGG CGCCTCACTG
GAGGTGATCC AGAAGGCCGA AAGCCGGCGC CAATTCCGGA CCGCTATGGA AAATATCGGG
CTGACTGTTC CCCAAAGCGG TATCGCCTCC AGTCTCGAGG AGGTCCGGGA ATGGACCAAC
ACCTTGAGTT TTCCCATGAT TGTCCGACCA GCCTTTACCC TGGGCGGAAC TGGCGGCGGA
GTGGCCTACA ACCGGGAGGA TCTCGAAGAG ATCGCCCGTC AGGGATTGGC GGCCAGCCCC
AGTCACGAGG TCATGCTCGA AGAGTCCGTG CTCGGCTGGA AAGAATACGA ACTCGAGGTC
ATGCGCGATA AGCACGACAA TTGCGTCATC ATCTGCTCCA TCGAAAATCT CGATGCCATG
GGAGTGCACA CCGGCGATTC CGTGACGGTC GCACCGGCCC AGACATTGAC CGACCGGGAA
TTTCAAAAGA TGCGTGACGC CTCGCTGGCG ATCATGCGCG AAATTGGCGT CGAAACCGGC
GGTTCCAACG TCCAATTCGC CGTCAATCCC GAAAACGGCG ACATGGTGGT CATCGAGATG
AACCCTCGGG TCTCCCGTTC TTCGGCCCTG GCTTCCAAAG CCACAGGATT TCCCATCGCC
AAGATCGCGG CCAAATTGGC TGTGGGCTAT ACGTTGGACG AGATCCCCAA TGACATTACC
CGCGAAACCA TGGCCTCCTT CGAGCCGACC ATCGACTATT GCGTGGTCAA GATTCCCCGA
TTTACCTTTG AAAAATTTCC CGGTTCCCAA GATTTTTTGA CCACGACCAT GAAAAGTGTC
GGGGAAACCA TGGCCATCGG CCGGACCTTC AAGGAATCCC TGCAAAAAGG GCTGCGTTCC
CTGGAGGTCG GCATGGCCGG CCTGGGCAAT GGCACCGGCG AAGACGATCT GGATGCCGAG
AGCCTGATGG CGCTTCTCAA GACCCCGAAT TCGCAGCGCC TTTTCGCTAT TCGCGACGCC
TTGCGCCAGG GGATGCCACT GGAGACCCTG TACCAGGCGA CTTTTGTCGA TCCCTGGTTT
TTGCAGCAGA TCAAGGCGAT CGTGGATTGC GAGGCCGAGC TGAAACATTT CAGCCTGGAG
CAGGCCATCA GCGCAGACAA CCCCCATATG GAAGAAGTCC TGCGCCATGC CAAGGAGTTC
GGGTTTTCCG ATCTCCAACT GGCCAATCTC TGGAAGCGCA CCGAGGACGA CATCCGGCAG
TTGCGGATCC AAATGGGGAT CACCGCTACC TATAAGCTCG TTGATACTTG CGCGGCGGAA
TTCGAAGCCT ATACGCCGTA TTATTATTCC TCGTACGAGC GGGAAAATGA AAACCGGGCT
ACGGGCAAAC GCAAGGTAGT CATCCTGGGC GGTGGCCCGA ACCGTATCGG TCAGGGGATC
GAATTCGACT ATTGCTGCGT CCACGCGTCC TACGCCTTGC GCGACATGGA TATCGAGTCG
GTCATGGTCA ATTCCAATCC GGAGACGGTC AGCACCGACT ACGACACCTC GGACAGACTC
TATTTTGAGC CCCTGACCCG CGAAGATGTC CTGAACATTA TCGAGCAGGA GCAACCCGAA
GGGGTTATTG TCCAGTTTGG CGGCCAGACG CCGCTCAATC TGGCTGTGCC GCTGCTGCGC
GCTGGCGTGC CGATTCTGGG CACGACCCCG GACTCCATCG ACCGGGCCGA AGACCGTGAG
CGGTTCCACG CTCTGCTTAA CAAACTCGGT CTCAAGCAGC CGGATAACGG CCTGGCGACC
TCTGAGGAGG GTGCCCTGGA TGTGGCCGAG CGCATCGGCT ACCCGGTGGT CGTCCGCCCC
TCCTATGTTC TGGGCGGACG GGCCATGGAA ATCGTCTACG ATTCGGATGA ACTCAAAAAT
TATTTCCGCG ATGCGGTAGT CGCTTCTCCC GACCACCCGA TCCTGATCGA CAAATTCCTG
CAAGGCGCTG TGGAAATCGA TGTCGACGCC TTGTGTGACG GGACCCAGAC CTACGTCGGC
GGGGTCATGG AGCACATTGA AGAAGCCGGT GTCCATTCCG GGGATTCGGC CTGTGTCTTG
CCGCCGCACA CCGTCAGTCA GGAGACCATC GACGAAATCC GCCGGCAGTC CAGTGCCCTG
GCTGAGGAAC TCGGCGTGGT CGGGCTGATG AACGTCCAGT TCGCAGTTCA AAACGGGGAG
ATCTTTATCC TGGAGGTCAA CCCGCGGGCC TCGCGAACGG TTCCTTTTGT CAGCAAGGCC
ACCGGTGTGC CGCTGGCCAA ACTGGCGACC CAGGTCATGA TGGGCCGGAC GCTGGAGGAC
CTTAATCCCT GGTCGATGCG TCAATGGGGA CACCTGGCGG TCAAGGAGGC GGTCATGCCG
TTTAACCGCT TCCCGGGCGT GGATGTCCTG CTTGGCCCGG AGATGCGCTC CACCGGCGAG
GTCATGGGCG TGGACACGGA ATTCGGCCTG GCCTTTATGA AGGCCCAGCT CGGTGCCGGG
CAGAAACTCC CTGCCAGCGG CACGGTGTTC ATTTCCGTCA ACGATGCCGA CAAGGATGCC
GTGGTCGAAG CGGCTCGGAC TTTTGCCCGG ATCGGATTGC GCATTGTTTC CACCGAGGGC
ACGGCGGCCT ATTTGACGCG GGCCGGTGTG GATTGCGAAC GGGTGAACAA GGTCTATGAA
GGCCGCCCCA ATGCCATCGA TCTGATCAAA AACGGCGAGA TCGATCTGGT CATCAATACC
TCCTCGGGCA AGAAAACGAT CCGGGACTCG TCTTCCCTGC GACAGACGAC CCTGCTCTAC
GGTATCCCGT ATACGACCAC AGTCGCCGGT GCCCGAGCCA TGGCCCAGGC CATTGCCGCT
TTGCAGGGGC ATGGACTGGA GGTGAAAAGC CTTCAGGAAT ACTACGGTAT GGAGTGA
 
Protein sequence
MPKRTDIRKI MLIGSGPIII GQACEFDYSG TQALKALKEE GYEVVLINSN PATIMTDPGL 
ADRTYVEPLE PESVAKIIAK ERPDALLPTL GGQTGLNTAV SVAESGVLEK YGVELIGASL
EVIQKAESRR QFRTAMENIG LTVPQSGIAS SLEEVREWTN TLSFPMIVRP AFTLGGTGGG
VAYNREDLEE IARQGLAASP SHEVMLEESV LGWKEYELEV MRDKHDNCVI ICSIENLDAM
GVHTGDSVTV APAQTLTDRE FQKMRDASLA IMREIGVETG GSNVQFAVNP ENGDMVVIEM
NPRVSRSSAL ASKATGFPIA KIAAKLAVGY TLDEIPNDIT RETMASFEPT IDYCVVKIPR
FTFEKFPGSQ DFLTTTMKSV GETMAIGRTF KESLQKGLRS LEVGMAGLGN GTGEDDLDAE
SLMALLKTPN SQRLFAIRDA LRQGMPLETL YQATFVDPWF LQQIKAIVDC EAELKHFSLE
QAISADNPHM EEVLRHAKEF GFSDLQLANL WKRTEDDIRQ LRIQMGITAT YKLVDTCAAE
FEAYTPYYYS SYERENENRA TGKRKVVILG GGPNRIGQGI EFDYCCVHAS YALRDMDIES
VMVNSNPETV STDYDTSDRL YFEPLTREDV LNIIEQEQPE GVIVQFGGQT PLNLAVPLLR
AGVPILGTTP DSIDRAEDRE RFHALLNKLG LKQPDNGLAT SEEGALDVAE RIGYPVVVRP
SYVLGGRAME IVYDSDELKN YFRDAVVASP DHPILIDKFL QGAVEIDVDA LCDGTQTYVG
GVMEHIEEAG VHSGDSACVL PPHTVSQETI DEIRRQSSAL AEELGVVGLM NVQFAVQNGE
IFILEVNPRA SRTVPFVSKA TGVPLAKLAT QVMMGRTLED LNPWSMRQWG HLAVKEAVMP
FNRFPGVDVL LGPEMRSTGE VMGVDTEFGL AFMKAQLGAG QKLPASGTVF ISVNDADKDA
VVEAARTFAR IGLRIVSTEG TAAYLTRAGV DCERVNKVYE GRPNAIDLIK NGEIDLVINT
SSGKKTIRDS SSLRQTTLLY GIPYTTTVAG ARAMAQAIAA LQGHGLEVKS LQEYYGME