Gene Francci3_1455 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1455 
Symbol 
ID3903187 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1745728 
End bp1746840 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content74% 
IMG OID637878792 
Productprephenate dehydrogenase 
Protein accessionYP_480561 
Protein GI86740161 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0287] Prephenate dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00804434 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGTCGGGT TGGAATGGGA TGCCGCACGC CTTCCGCGCC TGCGCCGGGT CGGGGTCGTC 
GGCACGGGCC TGATCGGGAC CAGCATCGGT CTCGCGCTGT CGGCAAGGGG CGTCGAGGTC
CTGCTCCGCG ACGTGGACGA GGCACAGGTC GCGCTTGCCG AGAAGATGGG GGCCGGGCGG
CCGTGGGCGG GGGAGCGGGT CGATCATGCG GTCGTCGCCA CCCCACTGCC GAGTGTCGCG
GTCCAGGTAC GCGCCCTGGC CCGCTCCGGC CTCGCGGACA CGATCAGCGA CGCCGGCAGC
GTCAAGGTCC GACCGTTGGT CGAGGGAGTC CAGCTCGGCT GCGACCTCAC GACGTGGTGC
CCGGCCCACC CCATCGCCGG CCGGGAACGG CACGGGGCCG TCTCCGCCCG GGCGGACCTG
TTCGCCGAGC GGGTCTGGGC GGTCTGTCCC GTCCCCCACA CCGGGTCGGC CGCGGTGCAC
GCCACGGTGG CGCTGGCGTT CGCCTGCGGG GCCACGCCGG TGCGGACCAC CCCGCAGCGT
CACGACGCGG CGATGGCGTC GGTCTCCCAC GTGCCCCAGA TCGTCGCGAG CGCGCTCGCG
GGGGCGCTCG TCGGGCTGCC GGAACGGGAC GTGCCCTTCG TCGGGCAGGG GTTCCGCGAC
ACCACCCGGC TCGCCGACAG CGACGCCGAG CTGTGGTCGG GGATCATCGA GGGTAACCGC
GGCCCGATCG CCGAGCGGGT GCGTTCCCTC GGCGCCCAGC TCACCGCGCT CGCCGACGTC
CTCGACACCG GTTCCGGTGA CGAGGTCACC GCCGCGGTCT CCCGGCTCAT GCGGGGCGGC
CAGGCCGGCC GTGCGCTGCT TCCGCGCAAG CCCGGCGCCC CGGCGCAGTC CTGGGGCTGG
GTGGGGGTCG TGCTCGACGA CCGACCCGGC CAGCTCGCCG CGCTCGTCGG CTTCATCAGC
CAGTGGCAGA TCAACATTGA GGATGTCGGG CCCTTCGAGC ACAGCCTTGA CGCACCCGCC
GGCATCGTCG AGCTCGCGGT GGATCCGACG GCCGCGGACG AGCTCGTCGA CCGGTTGACG
CTCAACGGCT GGACGGCATA CCGGCGATCC TGA
 
Protein sequence
MVGLEWDAAR LPRLRRVGVV GTGLIGTSIG LALSARGVEV LLRDVDEAQV ALAEKMGAGR 
PWAGERVDHA VVATPLPSVA VQVRALARSG LADTISDAGS VKVRPLVEGV QLGCDLTTWC
PAHPIAGRER HGAVSARADL FAERVWAVCP VPHTGSAAVH ATVALAFACG ATPVRTTPQR
HDAAMASVSH VPQIVASALA GALVGLPERD VPFVGQGFRD TTRLADSDAE LWSGIIEGNR
GPIAERVRSL GAQLTALADV LDTGSGDEVT AAVSRLMRGG QAGRALLPRK PGAPAQSWGW
VGVVLDDRPG QLAALVGFIS QWQINIEDVG PFEHSLDAPA GIVELAVDPT AADELVDRLT
LNGWTAYRRS