Gene Franean1_6555 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6555 
Symbol 
ID5674870 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7972968 
End bp7974209 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content69% 
IMG OID641245404 
Productlipopolysaccharide biosynthesis protein 
Protein accessionYP_001510798 
Protein GI158318290 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.381566 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGAGC CTTTCGATCC TGGCCTGGTA AAAGGCTCAT CCGTGGCGCT CGCGGCTTAC 
CAGGGGGAGT CGGAAATGGA CGTAGTGACG GTTTGTCGAA CGGTTTTTCG CCGGTGGTAC
GTGGTTGTTC CGGTCCTGGT TGCCACCGTG GTCGTGCTCT GGGTCACAAC GTCCAAGGCA
GAACCGGTCT ACCAGGCCGA GGCCCAGGCC ATCGTCGTCG GACCGTCGGT GGAACAGAAC
GGTGAAATCG TACAGCCAGT CAACCCGCTG TCCTATTTCA ACGACTCTCT CAAACTGCTC
ACACTGACCA TATCGAGGAT CATCAACGGC GAGCAGGCGC GGGACGGGAT CAGGGCGGCG
GGCTTCCGCG CGGATTACAC GGTGACGGCG CCACCGGAAA CGACATTCCT AGAGATCACC
GCGACCGATC CCGATCCCGC CGAGGCCACT CGCACAGCGG CGCAGGTACT GGACCAGATC
ACCGCCAACA CGAACGAACT ACAGATGTCG GTGCCGGCGG GTCTGCGTTA TGAGGTTCAG
CGACTGTTCC AACCGGTGCG CACGAGCAAC GAGCCTGGGC GCAGCCTCCA GTTGCTGGCG
ACCATCGCGG TCCTGGGAGC GCTGGCCGCG TTGGGGCTGG CACTGTCCGT CGACGCCGCG
GCCAGGCGAC GGGCCCGATC GCGCCGGGAG CGACCCCGGT CACACCGCGG CGCCTGGTCG
CGCCGGGATC CAAAGCGGAC CGGCGGACGT CGACACCTGG CCGGGTCGCC GAAGGCGGTG
TGGCCTGAGC CGGCGGTGTG GCCTGAGCCG GCGGTGCAGT CTGAGCCGGC GGTGCAGTCT
GAGCCGGCGG TGCAGTCTGA GCCGGCGGTG TGGCCTGAGC CGGCGGTGCA GTCTGAGCCG
GCGGTGCAGT CTGAGCCGGC GGTGCAGTCT GAGCCGGCGG TGCAGTCTGA GCCGGCGGTG
CAGTCTGAGC CGGCAAAGGC ACCGCCGGTA CCCGCCCAGG CCGCACCGGT GCCGGCCGGG
CAGCCCGCGG CCACGCCGAT GCCGTCCAAG GCCCCGAAGA CGGCCGCGCC GTCGGTGCCC
GCCCAGATGG CACGGGAAGA AGCCGGTGAG CGGTCCTGGT CCCGCGAAGG GGCCGGCGGA
TGGTCCTTCG CCGTAGCGTC CTTCGACGGA GCGTCCGACG ACGGAACTGG CGGGTGGTCC
TTCGACGGAG TCGGCAAGAG GCGGCCGGGG GCAGGCCTGT GA
 
Protein sequence
MTEPFDPGLV KGSSVALAAY QGESEMDVVT VCRTVFRRWY VVVPVLVATV VVLWVTTSKA 
EPVYQAEAQA IVVGPSVEQN GEIVQPVNPL SYFNDSLKLL TLTISRIING EQARDGIRAA
GFRADYTVTA PPETTFLEIT ATDPDPAEAT RTAAQVLDQI TANTNELQMS VPAGLRYEVQ
RLFQPVRTSN EPGRSLQLLA TIAVLGALAA LGLALSVDAA ARRRARSRRE RPRSHRGAWS
RRDPKRTGGR RHLAGSPKAV WPEPAVWPEP AVQSEPAVQS EPAVQSEPAV WPEPAVQSEP
AVQSEPAVQS EPAVQSEPAV QSEPAKAPPV PAQAAPVPAG QPAATPMPSK APKTAAPSVP
AQMAREEAGE RSWSREGAGG WSFAVASFDG ASDDGTGGWS FDGVGKRRPG AGL