Gene Rcas_4045 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4045 
Symbol 
ID5541556 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5246738 
End bp5249644 
Gene Length2907 bp 
Protein Length968 aa 
Translation table11 
GC content63% 
IMG OID640896158 
Producthypothetical protein 
Protein accessionYP_001434096 
Protein GI156743967 
COG category[V] Defense mechanisms 
COG ID[COG0577] ABC-type antimicrobial peptide transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.107257 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0242491 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTCGTC AACTGACTCT TGCTCTGGCG TTTCCCTCCG TTATCTGGCG ACGTTTGCTG 
AGCCATCTGG GGCTGGCGCT GGCGGTCTGG TCCGGCATGA CGCTGGCGGT TGGCATGGTG
GTCTGCATTC CGGTCTATGC CGAGGCGTCG GGGTATCGCA TTCTGTTGGC GGCGCTCACC
GAACGAGCCA TCGCCGATCC GCTGCCGCCC TTCGCAATGG TCTACCGTTA CGGCGGCGCA
TCCGACCCCT CGATTTCCTG GCAGCAATAT CTGCTTGCGG ATCAACTGGC GGGGCACCTG
CCGGCTGCGG GGATCGACCT GCCGGCGCCA CCGAGCGTTC GCTTCGCCGC CACGGAAAAA
CTGCGCGTCG GTTTTCCCGA TGGCGCCGGT AGAGAAGTGC TCTTCGCCCG TATAGGGTTT
CTCAGCGGCG TCGAACGGCA CATTCAGATT GTCGATGGCG ATCTGCCGCG ACCGTTCACC
GGCGATGGAC TGCTCGACGT GCTGGTGTCG GAAACGACGG CGTCGAAGAA TACGCTGCTG
GTCGATGATG TGTATCTGGT GCAATCGACC GGTCGCGGAG CGCGTATCGA GGCCCCGGTG
CGGATTGCCG GCATTTGGCG ACCGGCGGAT GCCGACGCGA GTTACTGGTT TCAGCCGCCA
TCGACGTATA GTGATGTGTT TCTTGTGCCG GAAGAGAGTT TCGTCCGGCT CGTCGATGTT
CCCGATGCGC GCTTCGTCAC CCTGACGGCG TGGTACACCG CAATCGATGG CAGCAGTGTG
CGCAGTAGCG ATGTCGAGCG GTTGCGAGAA CGGATTGCGA TGGCGACTGC CGATATTCAG
CAGCGTCTCC CCGGCGCCGA ACTGGTTCGC TCGCCGATGG ATGCGCTGGA GCGGCATCGT
GATCAGGTGC GGGTCTTGAC GGTCACGCTG GCGCTGTTTG CCGTGCCGCT GCTGGTGGTG
ATCGGGTATT TTGCAGTGCA GGTCACCGAA ATGACCGTTG CGCGCCAGCA ACAGGAAATG
GCGGTGCTGC GCAGCCGCGG TAGTTCGCGC TGGCAGGTGC TGGGGCTGGC GCTCGGCGAA
GTGTTGCTGC TCGGCGCTGC CGCATGGGTG GCCGGTCTGC CGCTGGGATG GTTGCTGGCG
CAGTTGATTG CCTGGACGGT CTCGTTTCTC AGATTCGCGC CGCTCGATAT TCCCCCGCCG
ACGTTACTCC CCGCCAGCCC ATGGCACGCG CTGGCGACGA TCCTGCTGGC GCTGCCGGCA
GTGCTTTTGC CTGCGTTGAG CGCCGCCGGG CGCACGATCA TTTCGTACAA GAGCGAACGG
GCGCGCGCCA CGCGCCCGCC CCTCTGGCAA CGGCTCTATC TCGATATTCT TCTGCTGATA
CCGGCGGTCT ATGGTTATCA ACAGTTACGG TTGAGTGGTA TGATCGGCGT GCCTGGCGTG
ACCGTCGGCG CGGATGATCC ATTTCGCAAT CCGCTCCTGC TCCTTGCGCC GGCGCTGATG
GTCTTTGCCG GTACGCTCGT CGGTATGCGC TTCCTGCCGT TGCTGCTGCG CTTGCTGGCA
TGGAGCGCAG GGCGGGCGCC AGGGGTGGCG CTGGTGACGG CGCTGCGCTT TCTGTCGCGC
ACGCCGGGAA CGTATGGCGG ACCGGTGCTC CTGGTTGCGC TGACGCTGGC GCTGGCGACA
TTCACATCAT CGATGGCGCT CACGCTGGAT CGTCATAGCG AAGAACGGGC ATACTACCGG
GGAGCGGCGG ATGTGCGCCT GGCATACCCC GGTGCGGCGA TCACCTCGGC GAATATTGCC
GGCGATCGCG AGATTGCACC GGCGGAAACC AGCCTCGATC TGAGCGGCGG AACGCTGGGT
ACGACGGGAG AAGCAGATAC GACGCCAAGC ACCGCATATA TGTTCGTGCC GATGGAAGAG
TATCTGACCA TCCCAGGGGT GACCGGCGCC ACGCGGGTCG CTCCGAGCAA AGCGGATATC
ATTGTAGGAA ATACGCCTGC GACCGGCGGC ATCTTCTACG GCGTCGACCG GACGACGCTG
GCGGCGGTGC TGGCGGATGC CTGGCGCCCT GACTATGCCG GTGAGTCGCT CGGCGCGTTG
ATGAACCGTC TGGCGGACTA CCCCGACGCA GCCCTGGTGT CGGAAACATT TGCGCGGGAG
CGGGGGTTGC GGATCGGCGA TCGCTTCGCA TTGGCGATGA ATGATCGCGG GCAAACGCAG
AATATCACCT TTACGGTCGT AGGGACGCTC AAGTACTTTC CAACGCTCTA CGATCAGGGT
TTGCCATTCG TGATCGGAAA TCTGGATTAC AGTTTCGACT CACAGGGTGG CCAGTATCCG
TATGAGGTCT GGTTATCGCT GGCGCCTGGC GCAAACCTGC AAAGCGTAGC AGCGGCAGGA
ATCGGGTATG GGTTGCGTAT GATCCGCTCG ACGCCGCAAA CGTTGCTCGA ACTCGATCTG
CGCCGACCGG AGCGCCAGGG TTTGTTCGGC TTGCTCTCGG TGGGTTTTCT GGCGACGACG
GTGGTCACGG TGATCGGGTT CGTTGCCTAT ACCCTGCTCT CGTTCCAGCG ACGGCTGGTG
GAGTTGGGGG TTCTGCGCGC AATTGGGCTG AGCACCCGGC AGTTGAGCGC CCTGCTGGTG
TTCGAGCAGA CGCTGGTGGT TGGGTTTGGT GCGTTGCTCG GCACTGCCGT CGGCATTCTG
ACGAGTCGCT TGTTCATCCC GTTTTTGCAG GTACGCACCG GCGTCTACCC GGATACGCCG
CCGTTCGTGG TGCAGATCGA CTGGGAGCGG ATTGTGCTGG TGTACGGTAT TTCCGGCGGG
CTGCTGGCGG CGACGATTGT CGCCATTGTG CTGCTGCTGC GTCGCATGCG GATTTTCGAA
GCGGTGAAAC TCGGTGAGGC GGTGTAA
 
Protein sequence
MIRQLTLALA FPSVIWRRLL SHLGLALAVW SGMTLAVGMV VCIPVYAEAS GYRILLAALT 
ERAIADPLPP FAMVYRYGGA SDPSISWQQY LLADQLAGHL PAAGIDLPAP PSVRFAATEK
LRVGFPDGAG REVLFARIGF LSGVERHIQI VDGDLPRPFT GDGLLDVLVS ETTASKNTLL
VDDVYLVQST GRGARIEAPV RIAGIWRPAD ADASYWFQPP STYSDVFLVP EESFVRLVDV
PDARFVTLTA WYTAIDGSSV RSSDVERLRE RIAMATADIQ QRLPGAELVR SPMDALERHR
DQVRVLTVTL ALFAVPLLVV IGYFAVQVTE MTVARQQQEM AVLRSRGSSR WQVLGLALGE
VLLLGAAAWV AGLPLGWLLA QLIAWTVSFL RFAPLDIPPP TLLPASPWHA LATILLALPA
VLLPALSAAG RTIISYKSER ARATRPPLWQ RLYLDILLLI PAVYGYQQLR LSGMIGVPGV
TVGADDPFRN PLLLLAPALM VFAGTLVGMR FLPLLLRLLA WSAGRAPGVA LVTALRFLSR
TPGTYGGPVL LVALTLALAT FTSSMALTLD RHSEERAYYR GAADVRLAYP GAAITSANIA
GDREIAPAET SLDLSGGTLG TTGEADTTPS TAYMFVPMEE YLTIPGVTGA TRVAPSKADI
IVGNTPATGG IFYGVDRTTL AAVLADAWRP DYAGESLGAL MNRLADYPDA ALVSETFARE
RGLRIGDRFA LAMNDRGQTQ NITFTVVGTL KYFPTLYDQG LPFVIGNLDY SFDSQGGQYP
YEVWLSLAPG ANLQSVAAAG IGYGLRMIRS TPQTLLELDL RRPERQGLFG LLSVGFLATT
VVTVIGFVAY TLLSFQRRLV ELGVLRAIGL STRQLSALLV FEQTLVVGFG ALLGTAVGIL
TSRLFIPFLQ VRTGVYPDTP PFVVQIDWER IVLVYGISGG LLAATIVAIV LLLRRMRIFE
AVKLGEAV