Gene Franean1_3801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3801 
Symbol 
ID5672165 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4509500 
End bp4510954 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content72% 
IMG OID641242680 
Productfibronectin type III domain-containing protein 
Protein accessionYP_001508100 
Protein GI158315592 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00993794 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.608011 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCCGG TTCTCGTCCT GCGGGACACG CGGTTGCAGG TCCGTCCCGG GGACACCGCC 
CGGACCAGCG CGACCGTGCG CAACGCCGGT GATCTCGTCG AGCAGTACGC CCTCGACGTG
CTCGGACCGG CCGCCGCCTG GGCGGAGGTC ATCCCTCCCA CCATCTCGGT GGTCCGCCGC
GGGGAGAGCA CCGTCCAGAT CCTGTTCCGG CCGCCGGTGG GGCCGACGAC CCCGGCGGGC
ACCGTGCCGT TCGCGCTGCG CTGCGTCTCC CGGGAGAACC CGGACAGCGT CGCGATCGCC
GAGGGGGACC TCGCCGTCGG GGCGATCCAC GAGATCGTCG CGTCGGTCAC CCCGGCGGTG
TCCCGCGGCC GGTGGTCCGG CCGGTGGACC GCGCGCTTCG AGAACCGCGG CACAGCACCG
GCCCGGCTGC GGCTGTCCGC CTCCGACGAA CGCCGGACGC TCGGTTTCGC GCTCGCCCCG
GTGGAGCTGG AGATCCCCCC GGGCGAATCC GGTTACGCCT TCCTGAAGGC GCGAGCCGCC
AAGCCGGCGC TGCTCGGTGC GCTGACCCGG CAGCAGGTGC GGCTGACCTA CACCCGGGAG
ACCGCGCCCG ACGAGCCGGT CGCCGAGGGT TTCGTGGATG TCTCCTTCGA ACACGTCCCG
GTGCTGTCCC GGGCGATGAC GACGATCGCC GGCCTCGCCC TCGTGGGCGG AGCCGCCGCC
GTAGTCCTGC TGAGCCAGTC GAGCCCGAAG GACGACACGG CGGCGCCGGG GGCCGCACCA
CCGGCACCGA CCACCTTCTC CGCGGAGACC GGTGACGGCG GTGTCGTCCG CCTGAGCTGG
TCGACGGTCC CCGGGGCGAA GGAGTACGGA ATCCAGAAGC TGGTCGGGGA CGAGGACGTC
GCGCTCGACA CCAAGAGGGT CGACGGGCAG CTGAACGCCT ACGACTTCAC GGGGCTCAAG
GGCGGCGAGC GGACCTGCTT CCGACTGGTG GCGTTCAACG ACTCCGGGGC TTCGCAGCCG
TCCCCGCACG CCTGCGCCAC CGCCGGGATC ACCCCGGAGC CGAGTCCCGG GCCCACGCCG
ACCGGACCCA CCCCGACCGG ACCAACGCCC ACGTCCCCCA CACCCACGCC CGAGACGCCG
AACCCGCAGC CCGGTCCCGG CACGCGGGAG CCGCGGGACG CCTACGTCGT CCTCAGTGTG
TTCGCCAAGG ACGACCAGGT CGCGAACGAC GCGCAGAAAC CTGCACAACG TGCCGGCGAG
ATCGGCTCGG CGCTCGGCGT CGACGTCGTC CTCGCCGACG CCGACAAGTC GACCCGGCTC
TCCGCCCAGT ACCCGGGCTT CCTCGTGATC TACGCGGACC GCTTCGTCAC GCCCGAGGAC
GCCGGGAAGT TCTGCACCGA CATGAGCGAC AAGCTCAGCA CCGTGAAGGC CATCTGCGTG
GCGCAGAACA ACTGA
 
Protein sequence
MDPVLVLRDT RLQVRPGDTA RTSATVRNAG DLVEQYALDV LGPAAAWAEV IPPTISVVRR 
GESTVQILFR PPVGPTTPAG TVPFALRCVS RENPDSVAIA EGDLAVGAIH EIVASVTPAV
SRGRWSGRWT ARFENRGTAP ARLRLSASDE RRTLGFALAP VELEIPPGES GYAFLKARAA
KPALLGALTR QQVRLTYTRE TAPDEPVAEG FVDVSFEHVP VLSRAMTTIA GLALVGGAAA
VVLLSQSSPK DDTAAPGAAP PAPTTFSAET GDGGVVRLSW STVPGAKEYG IQKLVGDEDV
ALDTKRVDGQ LNAYDFTGLK GGERTCFRLV AFNDSGASQP SPHACATAGI TPEPSPGPTP
TGPTPTGPTP TSPTPTPETP NPQPGPGTRE PRDAYVVLSV FAKDDQVAND AQKPAQRAGE
IGSALGVDVV LADADKSTRL SAQYPGFLVI YADRFVTPED AGKFCTDMSD KLSTVKAICV
AQNN