Gene Franean1_1952 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1952 
Symbol 
ID5670353 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2347551 
End bp2348879 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content74% 
IMG OID641240873 
Producthypothetical protein 
Protein accessionYP_001506295 
Protein GI158313787 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.780394 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.542159 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACCC GACCGACTCC CGCCGGGCGC GAGCGAGAGG CCGGCCGGGT CGACCGGACC 
GGCCCGCCCC AGCGGGTCCC GGTCCGCCGG GCGCCCCGGC TCCGCCCACT GCGGCGGATC
GCCGCCGTCG CCTCCGGGCT GGGCGTGGCC GGGTCGATCC TGCTCACCGG GTGCGTCGCC
AGTAGTACCG ACGAGACGTC CGGAGCCCGC GCCACGTGCG CGCCGACGCC GGGTGTCACC
CCCGACGAGG TCCGCTTCGG GGCCCTCTAC CCGGACACCG GCTCCGGTTC GCCGCTGTCC
CGCGCGTTCC GGGCCGGCAT CGACGCCAGG CTCGGGGTCG TCAACGGTGC CGGCGGCATC
CAGGGGCGGC AGGTGCGGTA CGACTGGCGC GACGACGAGT CGACGTCCGA CGGCGCACTG
CGCGGCGCCC GCCTGCTCGT CGACCGTGAC CAGGTCTTCG CGATCGTCGG CACCAGCGGC
ATCGCGACCG AGGCGGTGAG CTACCTCGCC GAGCGCGGCG TGCCGACCAT CGGCCAGGAT
CTGACCGCCA GCGGCGACAA CGCCTTCGGC TACTCCAACG TTCTCGGCGG CCAGCTCGGG
AACTCGGTCT TCGGAGTGTT CGCCCGTGCG CACGGCGCCA CCCGGGCCGT CCTGCTGCGG
ACCGAGCAGA TCCCCGCCTC GGGGCAGATC GACGAGCGGA TCGCCCACAG CCTGCGCGCA
GGCTCGGTCG AGGTCGTCGA CACCATCGAC TGGACGCCGA CCGGCTTCGA CCTGAACGCC
GTCGCGGCCC GGGTCCGCGC GGCCAACGCC GACATGATCA CCGGAGTGGT GCCCCCGCAG
GCGCTCGCGG ACGTCGCGAC CGCCGCCCGG CAGGCCGGGG CGACCATCAA GGTCGTGATG
GTGCCGATCG GGTACGACCC CGGCCTGCTC GATCGCCGTC CACAGGGCCT GGCCGGCAGC
TTCTTCCTCG TCGACTTCGT GCCCTTCGAG GCGGGGACCC CGGCGCACGA GCGCTACCTC
GACGCGATGT CCCGCTACGC CCCCCAGATC GAGCGCCGGG AGCAGACCAC CGGTCTGGTC
GGCTGGATGA CCGCCGACCT CTTCCTCCGG GGCCTCGTCG AGGCCGGTCC GTGCCCCACC
CGCGCGGGCT ACATCGCCGC GCTGCGCAAG GTGGCGGACT ACGACGCGGA CGGGCTGCTG
CCCAGGCCTG TGGACCTCGC GGTGAAGTCC GCGCCCACCG CCTGCGTGAG CGTGGTCCGA
GTGTCCCCGG CCGCGGACGC TTTCCAGGTC CAGATGCCCA TGGCCCTCTG CGGCGAGGCG
CTGAGCTGA
 
Protein sequence
MTTRPTPAGR EREAGRVDRT GPPQRVPVRR APRLRPLRRI AAVASGLGVA GSILLTGCVA 
SSTDETSGAR ATCAPTPGVT PDEVRFGALY PDTGSGSPLS RAFRAGIDAR LGVVNGAGGI
QGRQVRYDWR DDESTSDGAL RGARLLVDRD QVFAIVGTSG IATEAVSYLA ERGVPTIGQD
LTASGDNAFG YSNVLGGQLG NSVFGVFARA HGATRAVLLR TEQIPASGQI DERIAHSLRA
GSVEVVDTID WTPTGFDLNA VAARVRAANA DMITGVVPPQ ALADVATAAR QAGATIKVVM
VPIGYDPGLL DRRPQGLAGS FFLVDFVPFE AGTPAHERYL DAMSRYAPQI ERREQTTGLV
GWMTADLFLR GLVEAGPCPT RAGYIAALRK VADYDADGLL PRPVDLAVKS APTACVSVVR
VSPAADAFQV QMPMALCGEA LS