Gene Francci3_0698 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0698 
Symbol 
ID3906248 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp798750 
End bp800507 
Gene Length1758 bp 
Protein Length585 aa 
Translation table11 
GC content67% 
IMG OID637878031 
Productcarbamoyl-phosphate synthase L chain, ATP-binding 
Protein accessionYP_479811 
Protein GI86739411 
COG category[I] Lipid transport and metabolism 
COG ID[COG4770] Acetyl/propionyl-CoA carboxylase, alpha subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.933214 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGTAAGA TCCTCATCGC CAATCGGGGA GAAATCGCGG TTCGAGTGGC CCGCGCCTGC 
CGCGACGCCG GATACACCAG CGTCGCAGTC TACGCGGAAC CAGATATCAA TGCCCTGCAC
GTACGCGTTG CTGACGAGGC ATTTGCGTTG GGAGGCGCGA CCCCTGGTGA TTCGTACCTG
CGCATCGACA AAATTCTCGA TGCCTGCGAG TCCTCCGGGG CCGACGCCGT CCACCCGGGA
TACGGTTTTC TCTCGGAGAA TGCGGACTTC GCGGAGGCCG TGATCTCCGC AGGCCTCACC
TGGATCGGCC CGCCGCCGGA GGCCATCCGC CGTCTCGGAG ACAAGACCGC GGCCCGGCAT
ATTGCACTAG CGGTCGGCGC GCCGCTCGCG CCCGGCACCG CAGACCCCGT GGCCGGCGTC
GATGAGGTTG TCACCTTCGC CGGGCAGTAC GGCCTTCCGG TTGCCATCAA GGCCGCTTTT
GGTGGCGGAG GGCGCGGACT GAAGGTCGCT TGGACCGCGG AGGAGCTCCC CGAACTGTTT
GACAGTGCCG TCCGGGAGGC GGTCGCGGCG TTCGGTCGCG GGGAATGCTT CGTCGAGCGG
TATCTGGACC AGCCGCGCCA CGTCGAGACC CAGATCCTGG CGGATACGCA CGGCAACGTC
GTCGTGGTCG GTACCCGCGA CTGCTCACTC CAGCGCCGTT ACCAGAAGCT CGTGGAGGAG
GCGCCGGCAC CGTTTCTCTC GGAGGCTCAG CTCCACTCGC TGTACGACGC CTCGAGGGCG
ATCTGCAAGA AAGCCGGCTA TGTCGGGGCC GGGACCGTCG AGTTCCTCGT CGCCCGCGAT
GGCATGATCA GCTTCCTTGA AGTCAACACC CGCCTCCAGG TCGAGCATCC GGTGACCGAA
GAGGTCTCCG GAGTGGACCT GGTTCGTGCC CAGTTCCGTA TCGCCGAGGG TGAGGCACTC
GACTTCACCG ATCCGGCTCC TCGCGGTCAC TCCATCGAAT TCCGCATCAA CGGTGAGGAT
CCCGGCCGTA ACTTCCTTCC CGCACCCGGC ACGGTCACCA AGCTGATCGC GCCGGCCGGC
CCCGGCGTCC GGCTCGATAC CGGCATCGAA AGCGGCAGCG TCATCGGCGG CGCCTTCGAC
TCCCTGCTCG CAAAGCTCAT CATCACCGGA GCAACCCGGC AGGAAGCTCT GCAGCGGGCT
CGGCGTGCCC TCGATGAGAT GGTCATCGAG GGAATGGCGA CGGCGCTGCC GTTCCACCGC
GCGGTCGTAC GGGATCCCGC CTTCGCCCCG GAGGACGCCG CAGAGCCTTT CCGTATCCAC
AACCGCTGGA TCGAAACCGA GTTCGACAAC ACCATCCCGG CCTTCGCGGG AGGGACTGAG
GCCGACACCA CCCCCGAGCC GCGGGAGACC GTGGTTGTCG AGGTGAGCGG CAAACGCCTC
GAAGTCGTGC TACCCGCCGG ATTCGGTTCT GCCCCCGTCG CCGCTTCCCG GGGAGCGGCG
CCCAAGCGGC GCGCTCGCGG CACGGCGGCG GCCAGGGTCA CGGGCGACGC CCTGGCCAGT
CCCATGCAGG GCACCATCGT GAAGGTCGCG GTCAACGACG GCGACACCGT TTCCGCGGGT
GATCTGATCA TTGTTCTGGA AGCGATGAAG ATGGAACAGC CGATCAACGC GCATCGGGAT
GGCACGGTCA CCGGACTGGC TGCGACCGTC GGCGCCGTTG TCACCAGCGG CACCGTGCTC
TGTGAGATCA AAGACTAG
 
Protein sequence
MRKILIANRG EIAVRVARAC RDAGYTSVAV YAEPDINALH VRVADEAFAL GGATPGDSYL 
RIDKILDACE SSGADAVHPG YGFLSENADF AEAVISAGLT WIGPPPEAIR RLGDKTAARH
IALAVGAPLA PGTADPVAGV DEVVTFAGQY GLPVAIKAAF GGGGRGLKVA WTAEELPELF
DSAVREAVAA FGRGECFVER YLDQPRHVET QILADTHGNV VVVGTRDCSL QRRYQKLVEE
APAPFLSEAQ LHSLYDASRA ICKKAGYVGA GTVEFLVARD GMISFLEVNT RLQVEHPVTE
EVSGVDLVRA QFRIAEGEAL DFTDPAPRGH SIEFRINGED PGRNFLPAPG TVTKLIAPAG
PGVRLDTGIE SGSVIGGAFD SLLAKLIITG ATRQEALQRA RRALDEMVIE GMATALPFHR
AVVRDPAFAP EDAAEPFRIH NRWIETEFDN TIPAFAGGTE ADTTPEPRET VVVEVSGKRL
EVVLPAGFGS APVAASRGAA PKRRARGTAA ARVTGDALAS PMQGTIVKVA VNDGDTVSAG
DLIIVLEAMK MEQPINAHRD GTVTGLAATV GAVVTSGTVL CEIKD