Gene Franean1_7002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_7002 
Symbol 
ID5675313 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8534207 
End bp8535751 
Gene Length1545 bp 
Protein Length514 aa 
Translation table11 
GC content69% 
IMG OID641245848 
Producthypothetical protein 
Protein accessionYP_001511239 
Protein GI158318731 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.618001 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCTCAC TGATCCTGCT GATCATCGCA ATCTGGCTGG CCTTCTTTTT CTTCACCGTG 
CCCGGGCCGT TCATCGCCGG CGCCGCGGCC CTGTACGTCG CCGGGGCACT GGTCGTCGCC
TATTTCCAGG AATTCGGCCG GGCGATGGGG CTGGGTCCGA ACCCCGCCGC CGCCGCACCG
GAACAACCGC CGCCGCGGCG CACCGAGGAC GGGAAGGAAC CGGCCTACCG GCAGTACCTG
TTCGGACAGG CCCGTCACGA TCTACGGCAT GCGCACGGTC TCGTCTGGCC GCGGCTGGCC
GCGCTCGGCC GGAATCACCG GCGGTGGGTG AAGAACACCT TCTTCGGCTA CGGGGCGATG
GACTGGCACT GGCCGGTGGG CGTCGTCCTG ATGGTCGGCC TGTTCGCCGG GACGATCCTG
GGCCTTGCCG TCATCTCGCT GGTGGCGACG GCCCAGGGCG TGGTGCTGCT GGCCGTCTTT
CTGCTGGCGT TCCTCGGGAT CTACCTGCTG CGCGGCATCG ACACCGTCCT GCTGTGGATC
CGCGGGGTGC GCATCACCTG CCCGTCTTGC TACCGGCGCG GCTTCTACCC GTCCTACGAG
TGCCGGAACT GCACCGTCCG CCACCACGAC GTGCGGCCGG GCAGGTACGG CGTCGTCCGG
CGCGTCTGCG CGTGCGGGGA GCGCCTGCCG ACCCTGCTGC TGTTGGGCAG CCACCGGATG
AACGCCTTCT GCGCGCACTG CGAGGCTCCA CTGGCCGAGA GCGTGGGCAC CGCCGCTGAG
GTGGTGCTCC CGGTGTTCGG CGCGGCCGGC GCCGGCAAGA CCCGGCTGAT GATCGTCATC
ATGATGGCGG TCGAGGCGAT CGCCGGACGC AGCGGCGCCA CCCTCGCCCT GGCCGACGAG
GACACCCGGA AATGGGACGC CCAGGCCCGC CGCGAGCTCA TCAGGTCGGA CAAGGTCGCG
AAGACCGGAA TCCGGCTTCC CCGCGCCTAC TCGCTTTACG TCGAGCCGCG GCGGGGCGGC
CGGCGGCTCG TCCACGTCTT CGACCCGGCC GGCGAGTACT TCAACGAATC CGACCGCCTG
CAGGAGCTGC AGTTCCTCAC CCTCGCGCGC ACCTTCCTGT TCGTCGTCGA CCCCCTGTCG
GTAGACGCGC TGTGGGCCCG GCTCGACCGC GCCGACCAGA ACCGGTACAG CGGCGTCCGG
GCCAGGCGCG AACCCGAGTT CGTCTTCGCG CAGACGGTCC AGAACCTCGA GGCGATGGGC
GTGCGGACGA AGAAGGCCCG GCTGGTCGTG GTCGTGAGCA AACGCGACCT CGTCAGTCGG
ATCCTGATCG AGGACGGCGT GGAGGACGGC GAGGAGGCCC TCGTGCGCTG GCTCGACGAG
AACCTGCACC AGGGAAACAT GCTGCGGTCC ATGCGGCACG CGTTCGGCGA GGTTCAGTTC
TTTCTCACCA CCTCCATCAT TTCTGACGAC AGCCGGGTCG ACGACAGCAT CGAGAAACTG
ACGTCCTGGA CGCTCGCGCG GCAGGGCCTG CGGCTTTCGG GGTGA
 
Protein sequence
MGSLILLIIA IWLAFFFFTV PGPFIAGAAA LYVAGALVVA YFQEFGRAMG LGPNPAAAAP 
EQPPPRRTED GKEPAYRQYL FGQARHDLRH AHGLVWPRLA ALGRNHRRWV KNTFFGYGAM
DWHWPVGVVL MVGLFAGTIL GLAVISLVAT AQGVVLLAVF LLAFLGIYLL RGIDTVLLWI
RGVRITCPSC YRRGFYPSYE CRNCTVRHHD VRPGRYGVVR RVCACGERLP TLLLLGSHRM
NAFCAHCEAP LAESVGTAAE VVLPVFGAAG AGKTRLMIVI MMAVEAIAGR SGATLALADE
DTRKWDAQAR RELIRSDKVA KTGIRLPRAY SLYVEPRRGG RRLVHVFDPA GEYFNESDRL
QELQFLTLAR TFLFVVDPLS VDALWARLDR ADQNRYSGVR ARREPEFVFA QTVQNLEAMG
VRTKKARLVV VVSKRDLVSR ILIEDGVEDG EEALVRWLDE NLHQGNMLRS MRHAFGEVQF
FLTTSIISDD SRVDDSIEKL TSWTLARQGL RLSG