Gene Francci3_3198 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3198 
SymbolcarB 
ID3906164 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3788756 
End bp3792148 
Gene Length3393 bp 
Protein Length1130 aa 
Translation table11 
GC content70% 
IMG OID637880522 
Productcarbamoyl phosphate synthase large subunit 
Protein accessionYP_482284 
Protein GI86741884 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) 
TIGRFAM ID[TIGR01369] carbamoyl-phosphate synthase, large subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00397433 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGAAGC ATGAGGATCT GAGCAGCGTC CTGGTCCTCG GCTCCGGGCC GATCGTGATC 
GGCCAGGCGT GCGAGTTCGA CTACTCGGGT ACCCAGGCCT GCCGGGTGCT GCGCGCGGAG
GGCCTGCGGG TCATCCTGGT CAACAGCAAC CCGGCGACAA TCATGACCGA TCCGGAGATC
GCGGACGCGA CCTACGTCGA GCCGATCACC TCGGACATCG TCGCGAAGAT CATCGAACGG
GAGCGGCCCG ACGCGATCCT GGCCACCATG GGTGGCCAGA CCGCGCTGAA CACCGCCGTC
GCCCTGCACG ACGCCGGTGT CCTGGACCGC TTCGAGGTCC GTCTGATCGG CGCGAATATC
GACGCGATCC GCGCCGGGGA GGACCGTCAG GCGTTCAAGG ACATCGTCGC CGCGGTCGGC
GGCGAGACCG CGCGCAGCGC CATCTGCCAC ACCGTCGCCG AGTGCCTGAC CGCCGGGGAG
GAGTTCTCCT ACCCGGTGGT CGTGCGACCC AGCTTCACCC TCGGCGGCGC CGGCAGCGGC
TTCGCCCATG ATTCCGGCGA GTTGCGCCGG ATGGCGGCGG ACGGACTGGC CGCGAGCCCG
TCGACCGAGG TGCTGGTGGA GGAGTCGGTC CTCGGCTGGA AGGAGTACGA GCTCGAGCTG
ATGCGCGACC GCGCGGACAA CGTGGTCGTG GTCTGCTCGA TCGAGAACGT CGACCCGATG
GGGGTGCACA CCGGCGACTC GATCACGGTG GCGCCGGCGA TGACCCTGAC CGACCGCGAG
TACCAGCGCA TGCGGGACAT GGCGATCGCG GTGATGCGGG CCGTCGGCGT CGACGCCGGC
GGCTGCAACA TCCAGTTCGC CGTCGACCCT GCGACGGGCC GGCAGGTGGT CATCGAGATG
AACCCCCGGG TGTCGCGGTC CTCGGCGCTG GCGTCGAAGG CGACCGGCTT CCCGATCGCG
AAGATCTCGG CGAAGCTCGC GCTCGGATAC ACCCTTGACG AGATCCCCAA CGACATCACC
CGCACCACGC CCGCGGCCTT CGAGCCGGCG CTGGACTACG TCGTGGTGAA GGTGCCGCGG
TTCGCGTTCG AGAAGTTCCC CGGTGCCGAC CCGACGCTCA CGACGACGAT GAAGTCCGTC
GGCGAGGCGA TGGGGGTTGG CCGCAGCTTC GCCGAGGCGC TGCAGAAGGC GCTGCGCTCG
ATGGAGGCCC CGGGCTCGGT GTTCTCCTTC GTCCCGCCCG AGGCGGACGC CGCCGACCTG
CTGGAAGCCG CGCGGGTTCC CCACGACGGC CGGTTGCGTA CCGTCCAGCA GGCGCTGCTT
GCCGGCGCGG ATCCCGACGA GGTCACCCGG GTCACCGGGA TCGACCCGTG GTTCGTCGAC
CAGCTCGTCT TCCTCAACGA GACCGCGGCG ATGATCAGCC GCAATCCGGC GGGCATCCGG
CGGGCCAAGC GAGCCGGGTT CTCCGACGTC CAGCTCGCCG AGATCCTCGG CACCTCCGAG
GACGTGGTCC GGGCGTTCCG CCACCGCACC GGCATCCGGC CGGTGTTCAA GACCGTCGAC
ACCTGCGCGG CGGAGTTCGC CTCCGAGACC CCGTACCACT ACTCCGCCTA CGACGCCGAG
ACCGAGGTCG CGCCGAGCAA ACGGCCGCGG GTGATCGTGC TCGGCAGCGG GCCCAACCGC
ATCGGCCAGG GCATCGAGTT CGACTACGCC TGTTGTCATG CGGTGATGGC GCTTTCCGAC
GCCGGCTATG AGACCGTCAT GGTCAACTGC AACCCGGAGA CGGTGTCCAC CGATTACGAC
ACCGCCGACC GGCTCTACGT CGAGCCGTTG ACGGTCGAGG ACGTGCTTGA GGTCGTCCAC
GCCGAGCAGC AGGCCGGGCC GCTCGCCGGG GTAATCGTCC AGCTCGGCGG GCAGACCCCC
CTCGGCATCG CCGCGGCGCT CGCCGAGGCG GGCGTGCCCG TCGTGGGCAC CCCGCCCAAG
GCGATCCATC TGGCCGAGGA CCGTGGGCTG TTCGGGCGTG TGCTGGCCCG GGCGGGCCTG
CCCTCGCCGC CGCACGGGGT GGCGACCTCC TTCGCCGAGG CGCGCACGGT GGCCGAGCGG
ATCGGGTACC CGGTGCTGGT GCGGCCGTCC TACGTGCTCG GCGGGCGTGG CATGGAGATC
GTCTACGACG ACACGATGCT GCGCGACTAC ATCGATCGGG CCACCGCCAT CTCCCCGGAA
CATCCGGTGC TCGTCGACCG GTTCCTCGAC GACGCGGTGG AGATCGACGT TGACGCCCTC
TACGACGGGG AGACGCTCTA CCTCGCCGGG GTGATGGAGC ACATCGAGGA GGCCGGGGTC
CACTCGGGCG ACTCGGCCTG CGCCCTGCCG CCGATCACGC TGGGCCGTTC GGAGCTCGAC
CGTATCCGGA CGTCCACCGA GGCGATCGCG AAGGGCGTCG GGGTGCGGGG TCTGCTCAAC
GTCCAGTACG CCCTGCAGTC CGATGTGCTC TACGTCCTGG AGGCCAATCC CCGGGCGTCG
CGGACCGTGC CGTTCGTCTC CAAGGCCACC ACGGTGCCGC TGGCCAAGGC CGCCGCCCGG
GTGATGCTCG GCGCGACCAT CGACGAGCTG CGGGCCGAGG GCCTGCTGCC GCGCTCCGGC
GACGGTGGCA CCCTGCCGCT GGACTCGCAC ATCTCGGTCA AGGAGGCCGT GCTGCCCTTC
GGGCGCTTCC GTGACGCCGG TGGCCGCGGC GTCGACACCG TCCTCGGGCC GGAGATGAAG
TCGACCGGTG AGGTGATGGG CATCGACGAC GGCTTCGGCA CGGCGTACGC CAAGTCCCAG
GCCGCCGCGT ACGCCTCGCT GCCCACCTAC GGCCGGGTGT TCGTCTCCGT GGCCAACCGG
GACAAACGGG CGATGGTTTT TCCGATCAAG CGGCTCGCGG ATCTCGGTTT CGTCGTCTAC
GCCACCGAGG GGACCGCGGA CGTGTTGCGG CGCAACGGGG TCAAGGCGGT CGTCCTCGGC
AAGCACTGGG CCCCCACCGA GGGGCTACTC GACTGTGTCG AGATGATCAC ATCCGGGCAG
ATCGACCTCG TCATCAACAC CCCGTGGGGC GTCGGCCCGC GCCTGGACGG CTATGAGATC
CGCACAGCCT GCGTGAGTGC CGGGGTTCCG TGCATCACCA CGATCCAGGG TGCGGCGGCC
TGCGTGCAGG GCGTGGAAGC GCTGGTACGG GGGGAGCTGG GGGTCCGATC GTTGCAGGAA
TACCACGCGG CGCTGCGGCA GGCGTGGGGC GGGGGACAAC CGGGCGGCTC GCCGCCCGAA
TCCTCCGGCT CTGCGGCGTC TGAGTCCGCG GCGTCTGAGT CCGCGGCGCC TGAGTCCGCC
GTGTTCGGAC CGGCGGCCAG GAGGAGCGGA TGA
 
Protein sequence
MPKHEDLSSV LVLGSGPIVI GQACEFDYSG TQACRVLRAE GLRVILVNSN PATIMTDPEI 
ADATYVEPIT SDIVAKIIER ERPDAILATM GGQTALNTAV ALHDAGVLDR FEVRLIGANI
DAIRAGEDRQ AFKDIVAAVG GETARSAICH TVAECLTAGE EFSYPVVVRP SFTLGGAGSG
FAHDSGELRR MAADGLAASP STEVLVEESV LGWKEYELEL MRDRADNVVV VCSIENVDPM
GVHTGDSITV APAMTLTDRE YQRMRDMAIA VMRAVGVDAG GCNIQFAVDP ATGRQVVIEM
NPRVSRSSAL ASKATGFPIA KISAKLALGY TLDEIPNDIT RTTPAAFEPA LDYVVVKVPR
FAFEKFPGAD PTLTTTMKSV GEAMGVGRSF AEALQKALRS MEAPGSVFSF VPPEADAADL
LEAARVPHDG RLRTVQQALL AGADPDEVTR VTGIDPWFVD QLVFLNETAA MISRNPAGIR
RAKRAGFSDV QLAEILGTSE DVVRAFRHRT GIRPVFKTVD TCAAEFASET PYHYSAYDAE
TEVAPSKRPR VIVLGSGPNR IGQGIEFDYA CCHAVMALSD AGYETVMVNC NPETVSTDYD
TADRLYVEPL TVEDVLEVVH AEQQAGPLAG VIVQLGGQTP LGIAAALAEA GVPVVGTPPK
AIHLAEDRGL FGRVLARAGL PSPPHGVATS FAEARTVAER IGYPVLVRPS YVLGGRGMEI
VYDDTMLRDY IDRATAISPE HPVLVDRFLD DAVEIDVDAL YDGETLYLAG VMEHIEEAGV
HSGDSACALP PITLGRSELD RIRTSTEAIA KGVGVRGLLN VQYALQSDVL YVLEANPRAS
RTVPFVSKAT TVPLAKAAAR VMLGATIDEL RAEGLLPRSG DGGTLPLDSH ISVKEAVLPF
GRFRDAGGRG VDTVLGPEMK STGEVMGIDD GFGTAYAKSQ AAAYASLPTY GRVFVSVANR
DKRAMVFPIK RLADLGFVVY ATEGTADVLR RNGVKAVVLG KHWAPTEGLL DCVEMITSGQ
IDLVINTPWG VGPRLDGYEI RTACVSAGVP CITTIQGAAA CVQGVEALVR GELGVRSLQE
YHAALRQAWG GGQPGGSPPE SSGSAASESA ASESAAPESA VFGPAARRSG