Gene Franean1_6957 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6957 
Symbol 
ID5675270 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8479090 
End bp8481048 
Gene Length1959 bp 
Protein Length652 aa 
Translation table11 
GC content68% 
IMG OID641245806 
Producthypothetical protein 
Protein accessionYP_001511197 
Protein GI158318689 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.304536 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCAGG AACGAGAACC GGAAAGTCCG GTGGGAGTTC ATGATCTGAT ACGTCATCAC 
TGGATTTCGA GTAGGGATTT CGACGCGCTC GCGTCCGGAA CCGATCAACA CGGGCTGATC
TGCCAGCTTC GGGTTGCCGA ACGTAGTTAC CGATATCTCT CGCTCCGTAG TATCCTTGAT
TTCGCCCGCG GGCATGAGCC GACAACCGGG CTGCTGTCGT CTCCTGACAC GGCATGGGAC
CTTCTCGTGG AGGCCGAGCG CGTCGCTCCG GCCATGGTCA CAGCCATCCT GGACCTACCG
AGCGTCGGGG CCTGGGTGGC CCGGGCGCTG CGGCGGACCC GTGGGCTTCT GTACGACGAG
ATTCCGCTGT GGGTCGACCT CGGCTATCTC CATCTGCTCT CGGCGGCCGC CGGTATCCGA
TGCGGCATCC CTTTTAGGCT GGATGTTCCG CTACGCCACG GTCAGCTTCA CCTTCCGACG
CTCGGCTCCA TCGTGCTGCC GGGCAAGGAG ATCTGGGGTT CGACGACCGT CATCTCCGAC
GGGCGATCGG CGCATGCCCT CCTGCCTGTG GGCAAGATCC GTCTCGCGGG GCGCGACCCG
GTGGACGATC CACCCGGCTG GAGGAGAACG ACGAGCCTGG ATGCGCGGCA CCGCGGGGCG
GCGGCCACCG TCTACCTTGA CAGCAGCGAC CCCTATCGCA TGGTCGAAAC GCCCGCGCTG
CCCGAGAACA TCGGCATACC CGTCCAACGG CACTGGGAGT CCCTGTTCCA GGAAGCCTGG
GCGGAACTGG TGGAACAGGA CAGCGAGGTT GCCCGATGCG TGGCCGAGTG CACCCTCACG
CTGGTCCCGC TGCCCCGGGC AGAGCGGTTC AGGGAAAGAA GCGCATCGTT CGGCGACTCG
TTCGGCGGGG TTATTCTATC GCTGCCGGAT AGCCCGGAGC GGTTCGCTGT GACCCTGGTA
CACGAAATGC AGCACGCAAA ACTCGGCATC CTACTCCATC TATTCTCGTT CTTGCGCGGG
GAAGGAAGCA TGCTTGCCTA TGCGCCGTGG CGTGACGATC CCCGACCCCT GCAGGGCCTG
CTGCAGGGGA TCTACGCCTT TTTCGGCGTC GCCGGATTCT GGCGTCGTCG CTTCGCCATG
GCGAAGGGAG AGGAAGCGGC TCTCGCCGGC TTCGAGTTCG CCCTGTGGCG CGGCAAGGTG
AGCGAGGCCA CCGCACACGC CAGGAGTCGT CAGGAGTTCA CGGCCCTCGG CCATCGCTTT
CTGACCGGGA TAGCCACCAC AGTCACGGCG TGGTCGGCCG AACCGGTGTC GCCGTACTAC
GCAGGGCTGG CGAATCTCGC CGCGGCGGAC CATCGTGCGG GCTGGCGCGT CCACCACCTC
ATCCCTCCGG CCGCCGACGT CGCGAGCCTG TCCCGGGCCT GGATCGCCGG GACGGATCCG
CGTGAGCTTC GTGTCCCCGG CCCGTCCGCG CTCGTCCCGG ATGGGAAGGT GCCCGACCTG
GACACCCTGG TCGTCCTGGT CCGGTACTGG TTGGCCGACC GGGAGCTCTT CCGCCAGATC
GAACGGAATG GCCAGGTCGG CAGCGTGGTC ACCGGCGCGA CGGCGGCGGA TCTCCACCTG
GTCGCCGGAC GCCACGACGA GGCCGCCCGT GCCTATCTCG ACGAACTCGG GGAGCCCTCG
CCCGGACTGA CTGCCTGGAC GAGGCTCGGA TCAGCGCTCG CCACGAGCCC CGACCACCGG
ATGGCGGCGG CCGGCCGTGC CCTGCTCACC CGGCCCGAGC TCGTCCGGGC GGTGGCCCGT
GCCGTCGAGG CGGACACCGG CCGCAGACCA GCGCCGGTCG ACCTCGCCGG ATGGCTGGGT
GAGCTTCCCC CGAACGAGGA CCGGGTCCGC GGCACGGGGG CGGACGACGT GTCCGCCCCC
GCCCCGAGGC CCGGCGACTC GCAGGAAGCA GTGCACTGA
 
Protein sequence
MRQEREPESP VGVHDLIRHH WISSRDFDAL ASGTDQHGLI CQLRVAERSY RYLSLRSILD 
FARGHEPTTG LLSSPDTAWD LLVEAERVAP AMVTAILDLP SVGAWVARAL RRTRGLLYDE
IPLWVDLGYL HLLSAAAGIR CGIPFRLDVP LRHGQLHLPT LGSIVLPGKE IWGSTTVISD
GRSAHALLPV GKIRLAGRDP VDDPPGWRRT TSLDARHRGA AATVYLDSSD PYRMVETPAL
PENIGIPVQR HWESLFQEAW AELVEQDSEV ARCVAECTLT LVPLPRAERF RERSASFGDS
FGGVILSLPD SPERFAVTLV HEMQHAKLGI LLHLFSFLRG EGSMLAYAPW RDDPRPLQGL
LQGIYAFFGV AGFWRRRFAM AKGEEAALAG FEFALWRGKV SEATAHARSR QEFTALGHRF
LTGIATTVTA WSAEPVSPYY AGLANLAAAD HRAGWRVHHL IPPAADVASL SRAWIAGTDP
RELRVPGPSA LVPDGKVPDL DTLVVLVRYW LADRELFRQI ERNGQVGSVV TGATAADLHL
VAGRHDEAAR AYLDELGEPS PGLTAWTRLG SALATSPDHR MAAAGRALLT RPELVRAVAR
AVEADTGRRP APVDLAGWLG ELPPNEDRVR GTGADDVSAP APRPGDSQEA VH