Gene Franean1_4210 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4210 
Symbol 
ID5672565 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5014241 
End bp5015182 
Gene Length942 bp 
Protein Length313 aa 
Translation table11 
GC content76% 
IMG OID641243083 
Producthypothetical protein 
Protein accessionYP_001508500 
Protein GI158315992 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0115] Branched-chain amino acid aminotransferase/4-amino-4-deoxychorismate lyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.672369 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.890959 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGTTCCG TGGACACGAC ACCGCTGACG CACGTCGAGG TCGACGGCCA CCCCGCGGGC 
GTGGACGACC TGCGCCACCT CGCCCTGGCC AACGACGGCC ACCTCACGAC GATGCAGGTC
CGGGCCGGCC GGGTCCGGGG GCTGGCACTG CACCTGCGTC GGCTGGACCG CGCCAACCAG
GAGCTCTACG GCACCTACCT GGACGGCGGC CTGGTCCGCG ACCGGATCCG GCACGCGCTG
GCGACCACCC CGGCCCCGCA CGGCCCGGCG GAGAGCGGCG ACGCGACCGT GCGGACGGTG
GTCTTTCCCA CCGGAACGGG GTCGGTCTCG ATCCTGGTCT CGATCGGCCC GCCGGCCGAG
CCCGCCGCCG GCCCGCTGCG GCTGTGCTCG CTGCGCTACC AGCGGCCATT CCCGCACATC
AAGCACGTCG GCAGCTTCGG GCAGATCCAC TTCGGCCGGC TGGCCCGCGG GCGCGGCTTC
GACGACGCGC TGCTCGTCAC CGGGGAGGGC CTGGTCTGCG AGACCACCGT CGCCAACATC
GGTTTCGTCA CCGCGGCGCC GGCCAGGGTG ATCTGGCCGG ACGGCCCGTC GCTGGTCGGC
GTCACCATGG CGCTGCTCGA CGAGCGGCTG CGCCCCGGGG CCACCGCCGA GCCCGTCGCA
CATCAAGTCC CCGCCGCGGC GACCGCGGCC GCGGTGACGG CGCCCGCCAC CGCCACCATC
ACGCCCGCCG CCGCGGCCGG AGAGGAGCGC GGCCCGCTCC TCGAGTCCCG GCGGGAGCCG
GTACGGCTGG CGGACGTCGG CCACTTCCGG GCCGCCTTCG TCGCCAACGC GCGGGGGATC
GTGCCCGTCG ACCGGATCGA CACGACACCG GTCCCGGTGG ACGAGGCGGT GCTGGCCGCG
CTGCGCCGCG CCTACGCCTC GGTGCCCTGG GACGAGATCT GA
 
Protein sequence
MGSVDTTPLT HVEVDGHPAG VDDLRHLALA NDGHLTTMQV RAGRVRGLAL HLRRLDRANQ 
ELYGTYLDGG LVRDRIRHAL ATTPAPHGPA ESGDATVRTV VFPTGTGSVS ILVSIGPPAE
PAAGPLRLCS LRYQRPFPHI KHVGSFGQIH FGRLARGRGF DDALLVTGEG LVCETTVANI
GFVTAAPARV IWPDGPSLVG VTMALLDERL RPGATAEPVA HQVPAAATAA AVTAPATATI
TPAAAAGEER GPLLESRREP VRLADVGHFR AAFVANARGI VPVDRIDTTP VPVDEAVLAA
LRRAYASVPW DEI