Gene Franean1_2185 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2185 
Symbol 
ID5670585 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2619002 
End bp2620087 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content69% 
IMG OID641241106 
Productpentapeptide repeat-containing protein 
Protein accessionYP_001506527 
Protein GI158314019 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.225539 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCCCATGA CTGACAACGG GGGCACTCGT CGGCCAGCAG GTCGGCGGTC GCTGTGGATC 
GTTGCCGGGC TCGCCGCGGT CGGCGCGGTT GCCGCCGTGG TGGGGATCTG GCACCTGCCG
GACCGGATGT ACCCGCCGGG CACCGACGGT GAGGCGGAGG CGCGTGCGGC ACTGCAGGGC
GGTCTGTTGA CGGCGGCCGC CGCGCTGACG GCGGTGGCCG GTGCCCTGAT CGCACTGGAC
GAGACCCGGC AGGCCAACGC TGAGGTGCGT CGCGCGAACG CAGAGGTCCG CCGGGCGAAC
GAGAACACGC ATGTCCGGGA GTTGTATGCG ACGGCAATCG GGCTGCTGGG GGCGGACACG
ATCGACAGCC GCCTTGGTGG GATCTATGCC CTGGAACGGG TCGCTGTCGA CTCACCAGCC
GATCAGCGCA CCGTGGTGGA GGTCCTCTCG GCGTTCGTTC GAGTCCACAG CACCGACCCT
GCCCTACGCC CTGCTGTCCC TGACCCAGCT TCTCCTGTAC GCCCGGCGGT GGACGTGCAC
GCTGCTGTCA CCGTGCTAGC CCGTCTCCCC GTGATCCCCG ACATCCCACG TGCAGACCTG
AACGGGGCAA AACTCACCGG TCCGGCCGCC CTCGACCGTC TCCAAGCCGC CCGCGGCAAC
CTCGCCCAGG TCGAGCTTGC CGAGGCAGAC CTCCGCGGCG CCCGCTTGGA CGAAGCAGAT
CTTGCCGACA TCAAGATGGT CGAGGTTGAC TTCACCGGCG CGCAGATGGT CGGGGCGAAC
CTCGCCGGAG CACAGATGGT GGAGGCGAAC TTCGCTTGGG CCGAGCTGAC GAGAGCGGAC
TTCAGTGGGG CGCAGCTGGT GCAAGCGGAC TTCACGGAGG CGCAGATGGT CGGGGTGAAC
TTCACGGGGG CGCAGCTGGT GCAAGCGGAC TTCACGGGGG CGCGCTTGAA CGGTGCGAAC
CTGATGAACG CTGAGGGGGT GTCGCAGGAG CAGGTGGACG TCGCCTTCGG GGACAGCGAG
ACTCGCCTGC CGCCGGGGCT GACGCTTCCG GCGTCGTGGA CGGCTGGCGG TGCCAGTGGG
TCATGA
 
Protein sequence
MPMTDNGGTR RPAGRRSLWI VAGLAAVGAV AAVVGIWHLP DRMYPPGTDG EAEARAALQG 
GLLTAAAALT AVAGALIALD ETRQANAEVR RANAEVRRAN ENTHVRELYA TAIGLLGADT
IDSRLGGIYA LERVAVDSPA DQRTVVEVLS AFVRVHSTDP ALRPAVPDPA SPVRPAVDVH
AAVTVLARLP VIPDIPRADL NGAKLTGPAA LDRLQAARGN LAQVELAEAD LRGARLDEAD
LADIKMVEVD FTGAQMVGAN LAGAQMVEAN FAWAELTRAD FSGAQLVQAD FTEAQMVGVN
FTGAQLVQAD FTGARLNGAN LMNAEGVSQE QVDVAFGDSE TRLPPGLTLP ASWTAGGASG
S