Gene Franean1_4807 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4807 
Symbol 
ID5673148 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5739241 
End bp5740593 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content76% 
IMG OID641243663 
Producthypothetical protein 
Protein accessionYP_001509079 
Protein GI158316571 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.280649 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.472222 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGATCGCT CGGCGCCGGC ACCGCGGGCG GCCCGACCGA GGCGACCGGG ACCTTCGACC 
ACCTCCTCGT GCGGGGCGAG GCTCGCTCGG CCGGCGCGGC CGGCCAGGTG GCGCGGCCGG
CTGCTCGCCG CGAAGGCCGT CGTGGTCGGC GCCGTCGCGT TCGTGACCGG CCTGGTCGCC
GCCGCCGTCG CCGTCGTCTT CGGCCAGCGC GTGCTGCGCG GCAACGGCGT CTACGTCCAC
CCCGCGACGA CGCCGACCGA GCTGCGCGTG ATCGTCGGGA CCGCCGCGCT GCTCGCCGTC
GCCGCGGTCC TGGCGCTCGG GCTGGGGACG TTGCTGCGGC GCGGTGTCAC CGCGGTGGCG
ATCGCCGTCG CCGTGATCGT CCTGCCGTAT CTGCTGGCCA TGACCGTCCT GCCGGACGGG
GCCGCGGTGG CTGCTGCGGG TGAGCCCGGC GGCGGCGTTC GCGCTGCAGC AGACGGCGAC
GCAGTACCCG CAGGTCGCCA ACCTCTACAC GCCGGCGAAC GGGTACTTCC CCCTCGCCCC
GTGGGCCGGC TTCGGGGTGC TCGCCGGGTG GGCCGCCCTC GCCCTGGGCA CGGCCGCCGT
CCTTCTCCGG CGGAGGAGCG CGTGAGATCG GCCCTGCACG CCGAGTGGAC CAAGCTGCGG
ACCTCGCCCG GCACGCTCGG GCTGGCGCTC GCCGTGATCG TGAGCACGGT CGGGTCGAGC
GCCGCGGTGG CCGCGGCGAC CGGGTGCGCG CCAGGAGGCT GTGGGCAGGA CCTGACGAGA
CTGAGCCTCA CCGGGGTCCA GGTCGGTCAG GCCGTCGTCG CCGTCCTCGC GGTCCTGGTG
ATCGGCGACG AGTACAGCAC CGGGATGGTC CGGGTCACGC TCACCGCGCT GCCCCTGCGG
ACGACTGTCC TGGCCGCCAA GGCCGTCGTC GTCGCCGGGG TCGTCGCGGT GACGGCCGTG
CCCGCCGTCC TCGGGTCCCT GACCGTCGGG TGGTTCATCC TTCCCGAGCA GGAGGTCGTC
CCCCGGGCGG CCGTCGGTTC CGTGCTGTAC CTCGTCCTCA TCGGCCTGTT GGGCCTGGGA
ACGGCCACCG CCGCGCGGAA CCCGGCGGCT GCGTCCGGGA TCGTCCTGGG ACTGCTGTAC
GTGTTCCCGA TCATCGCCCA GGTGGTCACC GACCCGGGCT GGCGGCGGCA CCTGCAGCAG
GCCGGGCCGA TGAGCGCCGG GCTCGCCGTC CAGGCCACCG GCGACGTCGA CGCCGTGCCG
ATCGGACCGT GGCAGGGACT CGGCGTGCTC ACGCTGTGGG CGCTGGCCGC GCTCCTCACC
GGCGGCCTGC TCCTCGCCCG GCATGACGCC TGA
 
Protein sequence
MDRSAPAPRA ARPRRPGPST TSSCGARLAR PARPARWRGR LLAAKAVVVG AVAFVTGLVA 
AAVAVVFGQR VLRGNGVYVH PATTPTELRV IVGTAALLAV AAVLALGLGT LLRRGVTAVA
IAVAVIVLPY LLAMTVLPDG AAVAAAGEPG GGVRAAADGD AVPAGRQPLH AGERVLPPRP
VGRLRGARRV GRPRPGHGRR PSPAEERVRS ALHAEWTKLR TSPGTLGLAL AVIVSTVGSS
AAVAAATGCA PGGCGQDLTR LSLTGVQVGQ AVVAVLAVLV IGDEYSTGMV RVTLTALPLR
TTVLAAKAVV VAGVVAVTAV PAVLGSLTVG WFILPEQEVV PRAAVGSVLY LVLIGLLGLG
TATAARNPAA ASGIVLGLLY VFPIIAQVVT DPGWRRHLQQ AGPMSAGLAV QATGDVDAVP
IGPWQGLGVL TLWALAALLT GGLLLARHDA