Gene Tery_2877 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_2877 
SymbolcarB 
ID4244948 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp4479979 
End bp4483209 
Gene Length3231 bp 
Protein Length1076 aa 
Translation table11 
GC content41% 
IMG OID638107926 
Productcarbamoyl phosphate synthase large subunit 
Protein accessionYP_722523 
Protein GI113476462 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) 
TIGRFAM ID[TIGR01369] carbamoyl-phosphate synthase, large subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.1435 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCACGAC GCCAAGATAT ACGTAAAATC CTACTAGTTG GTTCTGGCCC CATTGTTATT 
GGCCAGGCCT GTGAATTTGA TTATAGTGGA ACTCAAGCTT GTAAAGCCCT TAGGGAAGAG
GGTTTTGAGG TAGTATTGGT AAATTCTAAT CCGGCTACAA TTATGACAGA TCCAGAAATT
GCCGAATGTA CCTACATTGA ACCTCTAACC CCGGAACTTA TAGAAAAAGT TATTGCTAAA
GAAAGACCAG ATGCTTTATT ACCAACAATG GGTGGTCAAA CTGCTTTAAA TATAGCAGTT
TCTCTGGCAA AAAATGGCAC TTTAGCTAAA TATGGTGTGG ATTTAATTGG GGCTAAACTA
CCAGCCATTG AAATGGCCGA AGATCGTCAA TTGTTTAAGG AAGCAATGGG TCGGATAGGC
ATCCAGGTTT GTCCATCAGG TATTGCTAGT AATTGGGAAG AGGCCAAGGC GATCGCTCAA
AAAATAGGAA CATATCCTCT AATTATTCGT CCTGCTTTTA CTTTAGGAGG GACAGGGGGA
GGTATTGCTT ATAACCAAGA AGAATATGAG GTAATGGCAC AAGGGGGGAT AGATGCTAGT
CCAGTATCTC AAATTTTGGT GGAAAAATCT CTAATTGGTT GGAAGGAATA CGAACTGGAA
GTCATGCGAG ATTTAGCAGA CAATGTGGTG ATTATTTGCT CAATTGAAAA TATTGACCCA
ATGGGGATAC ATACGGGAGA CTCTGTTACT GTTGCTCCGG CGCAAACATT GACGGATAAG
GAATATCAAA GACTACGAGA TGCTTCGATT AAAATTATTC GGGAAATAGG TGTGGAAACT
GGAGGGTCAA ATATTCAGTT TGCTGTTCAC CCGTTGAATG GGGATGTTAT TGTGATTGAA
ATGAACCCAC GGGTGAGTCG TTCTTCTGCT TTGGCGTCAA AAGCAACTGG TTTCCCTATA
GCGAAAATGG CGGCTAAGTT AGCAGTTGGT TATACTCTTG ATGAAATTCC AAATGATATT
ACGAAAAAGA CTCCAGCTAG TTTTGAACCT ACTATAGATT ATGTAGTTAC GAAAATTCCT
AGGTTTGCTT TTGAGAAGTT TCCCGGTTCC CAACCAGTGT TAACTACTCA GATGAAGTCG
GTTGGGGAGG CAATGGCAAT AGGTCGCACG TTTCAGGAGT CGTTCCAGAA AGCTTTACGG
TCTTTGGAAA CTGGTAGAGC GGGTTTTGGT GCTGATCGCT GGGAAAAAAT ACCCAGTTTA
GAACAAGTAA GATCGGGGTT ACGCACGCCT AACCCAGAAA GATTATTTAC AGTTCGTCAT
GCTTTATTAC TGGGAATGAG TTTAGAAGAA ATCTATGAGT TGACTGGAAT AGATATTTGG
TTTCTTGATA AGATGCGGGA GTTGATAAAT ACAGAAAAGT TAATTAAGTA TACTCCGCTC
AATGAATTAA CAAAGGAGCA GCTCTGGGAT ATTAAACGTC AAGGTTTTAG CGATCGCCAA
ATTGCTTATT GCAGTAAAAC TACCGAAGAT GAGGTCAGAA GCTATCGGTT GAGTTTGGGC
ATTAAACCTG TTTATAAAAC AGTGGATACT TGTGCTGCAG AGTTTGAGGC ACTGACACCT
TACTACTATT CTACTTATGA AGAAGAGTCA GAAATTTTAC CATCGGAAAA ACGGAAAGTA
ATGATTCTTG GAGGAGGACC TAACCGTATT GGTCAGGGAA TAGAATTTGA CTATTGTTGC
TGTCATGCTA GTTATGCTTT GGGAGCGGAT GGGTTTGAAA CAATAATGGT CAACTCTAAC
CCGGAAACTG TTTCTACTGA CTATGATACA AGCGATCGCC TCTATTTTGA ACCTTTGACT
AAAGAGGATG TACTGAATAT TATTGAAGCG GAAAACCCGG TAGGAATTAT TGTTCAGTTT
GGCGGACAAA CACCTCTGAA ATTAGCGGTT CCTCTGCAAA AATATTTAGA GCAAGTAGAT
GCTCAAACCA AAATATGGGG TACTTCGCCA GACTCTATTG ATACTGCTGA AGACCGGGAA
CGGTTTGAGA AGATTTTACG AGATTTAGAT ATTCGTCAAG CAGCAAATGG TCTGGCTCGT
AGCTTTGAGG ATGCTTTGAC AGTAGCACAT ACAATTGGTT ATCCAGTTGT GGTAAGACCT
TCCTATGTTT TGGGAGGTAG GGCTATGGAA ATTGTTTATT CTGACCAAGA GCTGGAACGT
TATATGAAGT TTGCAGTACA GGTGGAACCG GAACATCCAA TTTTAATTGA CAAATTTTTG
AACAATGCTG TTGAGGTAGA TGTGGATGCT ATTGCTGATA GCACAGGTCG GGTAGTTATC
GGTGGTATCA TGGAACATAT TGAGGAGGCA GGTATTCACT CTGGTGACTC TGCTTGTTCT
ATTCCTTATC AAACGTTAAC TTCTGCTGCG GTGGAAACAA TTCGTCAATG GACTGTTGCT
CTGGCAAAGG CGTTGAATGT GATCGGGTTA ATGAATATTC AGTTTGCAGT CCAAGGAGAA
ACAGTATATA TTTTGGAAGC TAACCCCCGC GCATCAAGGA CAGTACCTTT TGTCTCTAAG
GCAATAGGAA TACCTTTGGC AAAAATAGCA TCCCGGGTTA TGTCTGGGAA AAATTTGGAG
GAATTGGGTT TTATAGATGA ACAAATACCT GATCATGTTT CTGTGAAAGA GGCAGTGTTA
CCATTTGCTA AATTCCCTGG TATAGATACA GTATTAGGAC CTGAAATGCG CTCGACAGGT
GAGGCTATGG GTATTGATGT TGACTTTGGT AAAGCTTTTG CTAAGGCACA GTTATCTGCA
GGTCAAAAGT TGCCTTTAGA GGGAACAGTG TTTGTGTCAA TGAGCGATCG CTATAAAGAA
GCAGTTGTAC CAGTGGTTAA AGATTTAGTT GATTTGGGTT TGAAGGTAGT AGCGACTGAA
GGAACTCGGA AGATTTTGCG TTTGCATGAT TTAGAAGTGG GTTTGATGCT GAAGTTGCAT
GAAGGGCGAC CTAATGTTTT GGATGGAATT AAAAATGGAG AAATTCAGTT GATTATTATA
ATTCCTTCTG GGGATGAAGC CCGTGCAGAT GGGATTAAAA TTCGGCGGAG TGCGTTGGAT
TATAAAATAA CGTTAATTAC AACTATTGCG GGGGCGAAGG CGACGGCGGC CGCAATTCGA
GCATTGAAGT CGGGAGCGTT GGAGGTGAAG GCGATACAGG ATTATTATTA G
 
Protein sequence
MPRRQDIRKI LLVGSGPIVI GQACEFDYSG TQACKALREE GFEVVLVNSN PATIMTDPEI 
AECTYIEPLT PELIEKVIAK ERPDALLPTM GGQTALNIAV SLAKNGTLAK YGVDLIGAKL
PAIEMAEDRQ LFKEAMGRIG IQVCPSGIAS NWEEAKAIAQ KIGTYPLIIR PAFTLGGTGG
GIAYNQEEYE VMAQGGIDAS PVSQILVEKS LIGWKEYELE VMRDLADNVV IICSIENIDP
MGIHTGDSVT VAPAQTLTDK EYQRLRDASI KIIREIGVET GGSNIQFAVH PLNGDVIVIE
MNPRVSRSSA LASKATGFPI AKMAAKLAVG YTLDEIPNDI TKKTPASFEP TIDYVVTKIP
RFAFEKFPGS QPVLTTQMKS VGEAMAIGRT FQESFQKALR SLETGRAGFG ADRWEKIPSL
EQVRSGLRTP NPERLFTVRH ALLLGMSLEE IYELTGIDIW FLDKMRELIN TEKLIKYTPL
NELTKEQLWD IKRQGFSDRQ IAYCSKTTED EVRSYRLSLG IKPVYKTVDT CAAEFEALTP
YYYSTYEEES EILPSEKRKV MILGGGPNRI GQGIEFDYCC CHASYALGAD GFETIMVNSN
PETVSTDYDT SDRLYFEPLT KEDVLNIIEA ENPVGIIVQF GGQTPLKLAV PLQKYLEQVD
AQTKIWGTSP DSIDTAEDRE RFEKILRDLD IRQAANGLAR SFEDALTVAH TIGYPVVVRP
SYVLGGRAME IVYSDQELER YMKFAVQVEP EHPILIDKFL NNAVEVDVDA IADSTGRVVI
GGIMEHIEEA GIHSGDSACS IPYQTLTSAA VETIRQWTVA LAKALNVIGL MNIQFAVQGE
TVYILEANPR ASRTVPFVSK AIGIPLAKIA SRVMSGKNLE ELGFIDEQIP DHVSVKEAVL
PFAKFPGIDT VLGPEMRSTG EAMGIDVDFG KAFAKAQLSA GQKLPLEGTV FVSMSDRYKE
AVVPVVKDLV DLGLKVVATE GTRKILRLHD LEVGLMLKLH EGRPNVLDGI KNGEIQLIII
IPSGDEARAD GIKIRRSALD YKITLITTIA GAKATAAAIR ALKSGALEVK AIQDYY