Gene Franean1_1339 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1339 
Symbol 
ID5669750 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1611132 
End bp1612844 
Gene Length1713 bp 
Protein Length570 aa 
Translation table11 
GC content66% 
IMG OID641240270 
Producthypothetical protein 
Protein accessionYP_001505697 
Protein GI158313189 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGGCC CGCGTCGCCA GCGGGCGCAC GCCCGCCATC GAGCGGGGCA TCGACCCTGG 
ACTTGTCCCG GCCGTCGCCC GGGGTGTCGC GAGCACGGCC GCAGCGGTCC GGCACCAGTT
AGCTACCGTG ACGCACGCCG CGGTGTACGG CTTACGGTCG GCGTGCTGGC GGTGGTGAGC
ATTCTCGGAT TTGCGGTGTG GACGTTCGTC GACCGTTCGT CTTCGCCGGA GGCCCTTGCT
GCTTCGGTTG TCGCCTCTAC GGCGGATGAT CCGGATTGCC GCACGGCCAC CGCGACCTTC
ACGGACGTTT CGGTGCCCGG GTATGGCGGC CCGGCGCCCG CAGGCTGGCA GCCAGCTCAG
CGCTGCCCAT CCGATAGTCG GCTGGGGGCG CGATCGCCGG AGAGCGGTTT CGCAGCCCAG
CCGGCGCAGA TTCCGGGAAT CGGCGGGATT GTTGGTGGTC TGGTGGGCGG AGGCATCACG
GGGGTTATGG AAACGGCGGT CGAAACCGTA GTGAACCGGG TGTCGCAGAA TCTGGTTGAC
GCGGCCAAAA GTATGATCCT CGAGTTCCTC GGGGCATCGA CCCGGCCGCA GGTGACAGCA
GAGGAGTTTA TCGGCCCTCA CGGTGCGTAC CACAGCACGG CATCGATGGC CACTCTGCTG
CTCGTCGGAT GCGTGATGAT CGGTGTTGGG CAGGGACTCT GGTCTGGCGA GCCGGTCCAG
GCCATGCTGC GACTTCTTGG CGATATACCC GTCGCGGTAC TGGCCATACT GGGTTTTCCG
TGGGTCGTGG ATCAGCTGGT CACGATTTCC GATGTGATGG CCGACTGGGT GCTCGGGAAC
GACCTTCGTA CCAGGAATGA GATCCTCGAT CTGGTCGTGC CTTTCTCTGG CGGCCCAGAC
GGTAACGTCG GGTATCTGAT CCCCCGGATC TTCGTGTATC TCGGGGTCGC CTTGATATAC
CTGGAACTCG TCGTGCGGAA CGGTCTGATC TACATGGTCG TCGCGCTTGC CCCGCTGTCA
TTCATGGCGA TAACGATGTC AGGGGCTAAA TTGGCGGCGC GGAAGGCTGT CGAAATGGTT
GTCGCCATAA TTTTGATCAA GCCGGCGGTA TTCGTCGAGC TACGGGTAGG GCTCGACCTC
GCTCATCCGG GTCTCGGTAG CCCGGCGGCC GATGGTGATG CGTGGGGAGA AATCTTTGTC
GGCATGGCGA TCGTGTTTAT CGCCGCGTTC ATGCCATGGA TCATCTGGCG CCTCATGCCC
CTGATGGAAC ATGCGATGGT CGCACAAGGA GTTGCCCGAG CGCCGTTCCG CGCTGGGATG
CAGGCCATGC AGATGGTGTA CTTCGGGTCC GCGCTGGCCG GGCGCGGCGC CCGCGGTGGA
GCAGGCGGTG GACGCGGCAG GGTGTTCGGG CAGCAGCCAG CCGGCGCTGG AGGCGGAGGC
GGGTTCGGCC CGCCTCGGAG TCTGACCGGT ACGGGTGCCT CCGCGAACGG CCCAACCCGC
CCGATGACAC GTGACAGCTC TGGCGCGGGC AGTGAACGGC GCGGAACTAG GCGGGCAGAG
ACACCTTCTT CGTCGAAGCC GCCGTCTGGT CCGGACCTGG GGGAGCGTTC GCTGGGTGAT
CCGCGGTCTG GCGGGCGCGT TCGGCGGGGC GAGCCTCCGC CTGCTCCGCC CGGTGACCGG
CGAGGACCCG AAAGTCCGCG GGGCCGGTCA TGA
 
Protein sequence
MTGPRRQRAH ARHRAGHRPW TCPGRRPGCR EHGRSGPAPV SYRDARRGVR LTVGVLAVVS 
ILGFAVWTFV DRSSSPEALA ASVVASTADD PDCRTATATF TDVSVPGYGG PAPAGWQPAQ
RCPSDSRLGA RSPESGFAAQ PAQIPGIGGI VGGLVGGGIT GVMETAVETV VNRVSQNLVD
AAKSMILEFL GASTRPQVTA EEFIGPHGAY HSTASMATLL LVGCVMIGVG QGLWSGEPVQ
AMLRLLGDIP VAVLAILGFP WVVDQLVTIS DVMADWVLGN DLRTRNEILD LVVPFSGGPD
GNVGYLIPRI FVYLGVALIY LELVVRNGLI YMVVALAPLS FMAITMSGAK LAARKAVEMV
VAIILIKPAV FVELRVGLDL AHPGLGSPAA DGDAWGEIFV GMAIVFIAAF MPWIIWRLMP
LMEHAMVAQG VARAPFRAGM QAMQMVYFGS ALAGRGARGG AGGGRGRVFG QQPAGAGGGG
GFGPPRSLTG TGASANGPTR PMTRDSSGAG SERRGTRRAE TPSSSKPPSG PDLGERSLGD
PRSGGRVRRG EPPPAPPGDR RGPESPRGRS