Gene Franean1_4049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4049 
Symbol 
ID5672407 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4827989 
End bp4829071 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content76% 
IMG OID641242925 
Productpyruvate carboxyltransferase 
Protein accessionYP_001508342 
Protein GI158315834 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.651223 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.299101 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCAGC CTGATCCGCG CCGGCCCGAC CCGTACCGGC CCGACGCGCG CCAGCCAGGT 
GGGCTCCAGC CGGACGTCCA CCGGGCGGAC GGCCTGCCCG AGTCGGTGAC GATCTACGAG
GTGGGCCCTC GCGACGGGCT GCAGAACGAG GCCTCGGTCG TCGACGTGGC GGTCAAGGCG
GAGTTCGTCC GGCGGCTCGC CGGCGCCGGG CTGCGGGCGA TCGAGACGAC GAGCTTCGTC
CCGGCGGCCT GGGTTCCGCA GCTCGCCGAC GCGGCGGAGC TGCTGGAGCT GCTGGGCCCG
CCGCCGCGGG GCGCCCGGCG GCCGGTGCTG GTCCCGAACA CGCGGGGGCT GGAACGGGCC
CTGGCGGCCG GGGCCGGCGA GATCGCGGTT TTCGGCAGCG CCACCGAGAC CTTCGCGCGG
CGCAACCTCA ACCGCACGGT CGACGAGTCG CTGGCGATGT TCGAGCCGGT GGTGACGGCG
GCCCGGGCCC GGGGCCTGGC GGTCCGCGGC TACCTGTCGA TGTGCTTCGG CGACCCGTGG
GAGGGCGCCG TCCCGCCCGG GCAGGTCGCC GCGATCGCCC GGCGCCTGGT CGACCTCGGT
GTTGACGAAC TGTCGCTGGG CGACACCATT GGAGTGGCCA CCCCCGGCCA CGTCGAGGCC
CTGCTCGCCG AGCTGGCCGC GGCCGGGATC GGCCCGGGGA CCCTCGCGGT GCACTTCCAC
GACACCTACG GCCAGGCTCT CGCCAACACC CTTGCCGCGC TGCGCGCCGG GGTGCGCACG
GTCGACTCGT CCGCGGGCGG CCTCGGCGGC TGTCCGTACG CGAGGAGCGC GACCGGCAAC
CTCGGCACCG AGGACCTCGT CTGGATGCTG CACGGCCTCG GCGTCGGCAC CGGCGTCGAC
CTCGGGGCGC TGGTGCGCAC CAGCGTGTGG ATGGGGGAGC GGCTGGGCCG GCCGAGCCCG
TCACGAGTCG TGCACGCGCT GGCAGCGCGG CTCGCCGAGG ACGGTCTCGC CGAGGACGGT
GCCACGGCGG ACGGATGTAG TGAAGCAGGA GAAGGAGACG GGCCATGCCT GACAGCCGGC
TGA
 
Protein sequence
MPQPDPRRPD PYRPDARQPG GLQPDVHRAD GLPESVTIYE VGPRDGLQNE ASVVDVAVKA 
EFVRRLAGAG LRAIETTSFV PAAWVPQLAD AAELLELLGP PPRGARRPVL VPNTRGLERA
LAAGAGEIAV FGSATETFAR RNLNRTVDES LAMFEPVVTA ARARGLAVRG YLSMCFGDPW
EGAVPPGQVA AIARRLVDLG VDELSLGDTI GVATPGHVEA LLAELAAAGI GPGTLAVHFH
DTYGQALANT LAALRAGVRT VDSSAGGLGG CPYARSATGN LGTEDLVWML HGLGVGTGVD
LGALVRTSVW MGERLGRPSP SRVVHALAAR LAEDGLAEDG ATADGCSEAG EGDGPCLTAG