Gene Franean1_3949 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3949 
Symbol 
ID5672310 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4721200 
End bp4722546 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content73% 
IMG OID641242828 
Productcarbamoyl-phosphate synthase L chain ATP-binding 
Protein accessionYP_001508245 
Protein GI158315737 
COG category[I] Lipid transport and metabolism 
COG ID[COG4770] Acetyl/propionyl-CoA carboxylase, alpha subunit 
TIGRFAM ID[TIGR00514] acetyl-CoA carboxylase, biotin carboxylase subunit 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.866278 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0162238 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCAAGA AGGTCCTGAT CGCGAACCGC GGGGAGATCG CGCTGCGGGT CGCGCGCACC 
TGCCGCGAGC TGGGCATCGC CACCGTGGCC GCCCACTCGT CGGCGGACCG GGACTCCGCC
GTCGTCTCCT TCGCCGACGA GTCCGTCCAG ATCGGCCCGG GGCCGTCCCG GGACAGCTAC
CTCAACATCC CGGCGGTCGT CGAGGCCGCC CGTATCCGCG GCGCCGACGC CGTCCACCCC
GGGTACGGGT TCCTCTCCGA GAATCCCGAC TTCGCCGAGA TCTGCGCGGC CGAGGGCCTG
ACGTTCATCG GCCCGCCGGC CGAGGTGATG GCCCGCCTCG GTGACAAGGC TGTCGCCCGT
CGGATCATGT CCGAGGCCGG GCTGCCCCTG CTGCCCGGCA GCCGGGACAC TCCCGACCAC
CCCGAACAGG CCGCGCAGGT GGCGGCCGAG ATCGGCTACC CGGTCATCAT CAAGGCGGCG
GCCGGCGGCG GCGGGCGCGG GATGAGCGTC GTGCACGATC CGGACGGCTT CGAGCGCGCC
TACCGGCACA CCCGCTCCAC CGCCCAGGCG GTGTTCGGGG ACGGCCGCCT GTACGTCGAG
CGTTACCTGG CCTCGGCACG GCACGTCGAG GTCCAGGTGC TCGCCGATGC GCACGGCGCG
GCCGTCCACC TGGGCGCCCG GGACTGCTCG CTGCAGCGCC GGCACCAGAA GCTGGTCGAG
GAGACTCCGG CGCCGGCCCT GCCAGCCGAG ATCGTCGAGC CGCTGTGTGC GGCCGCCGTC
CGCGGCACCC TGGCCAGCGG GTATGTCGGC GCGGGCACCT TCGAGTTCCT CGTCGACGGC
GAGGGCCGGT TCTACTTCAT GGAGGTCAAC TGCCGGCTGC AGGTGGAGCA CCCGGTCACC
GAGATGGTCA CCGGGCTCGA CCTGGTCGCC GAGCAGATCC GGATCGCCGC GGGCGAGCCG
CTGGGCTACG GCCAGGACGA CGTCGACCCG CGCGGGGTGT CGATCGAGTG CCGGATCAAC
GCCGAGGACC CGCGTCGTGA CTTCGCCCCC GCGCCGGGTT CGCTCACCGA GTTCACCGTC
CCGGCCGGCC CGTTCGTCCG CGTCGACACG CACGCCGCCC CCGGGTACCG CATCCCGGCC
TACTACGACT CACTGGTCGC GAAAGTGGTT GTCTGGGCCC CGGACCGCCC CCGGGCCCTG
GCCCGGATGC GCCGCGCCCT CGACGAGCTG CACGCCGACG GGCCCGGCGT CGTCACCACC
GCCGGCTTCC TGCGCGAACT GCTCGACCAC CCCCGGTTCA TCGCCGCCGA ACACGACACC
GTCCTGATCG AGTCCATGAC CCACTAA
 
Protein sequence
MIKKVLIANR GEIALRVART CRELGIATVA AHSSADRDSA VVSFADESVQ IGPGPSRDSY 
LNIPAVVEAA RIRGADAVHP GYGFLSENPD FAEICAAEGL TFIGPPAEVM ARLGDKAVAR
RIMSEAGLPL LPGSRDTPDH PEQAAQVAAE IGYPVIIKAA AGGGGRGMSV VHDPDGFERA
YRHTRSTAQA VFGDGRLYVE RYLASARHVE VQVLADAHGA AVHLGARDCS LQRRHQKLVE
ETPAPALPAE IVEPLCAAAV RGTLASGYVG AGTFEFLVDG EGRFYFMEVN CRLQVEHPVT
EMVTGLDLVA EQIRIAAGEP LGYGQDDVDP RGVSIECRIN AEDPRRDFAP APGSLTEFTV
PAGPFVRVDT HAAPGYRIPA YYDSLVAKVV VWAPDRPRAL ARMRRALDEL HADGPGVVTT
AGFLRELLDH PRFIAAEHDT VLIESMTH